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Novel fusion proteins comprise an E. coli TrpLE protein 







segment, optionally a cleavable polypeptide linker, optionally a 
three-frame linker oligopeptide, and a heterologous polypep- 
tide. 
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METHOD FOR PRODUCING FUSION PROTEINS 
10 — 



This invention relates to a novel method for 
producing fusion proteins using the TrpLE gene of 
E. colj . to the fusion proteins so obtained and their 
uses, and to the modified plasmids used to obtain said 
proteins. 

The technical and economical advantages of using 
recombinant ONA methodology to produce useful proteins 
and polypeptides from essentially any source in another 
organism, such as E. coli , are now well established; In 
general, the method comprises the construction of a 
self-replicating vector capable of achieving a high copy 
number within a host cell, in which a strong 
promoter-operator controls the expression of a desired 
heterologous gene. Subsequent transformation of a host 
(e.g., E. coli ) with the vector, growth of the host 
culture to a high density and induction of the promoter 
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then leads to production of the heterologous 
polypeptide. High yields of the desired polypeptide 
naturally require less purification to produce a 
homogeneous product, but the varied phy siochemical 
properties of each specific polypeptide generally require 
an individualized purification scheme for each 
polypeptide . 

Much emphasis has been given to the characterization 
of strongly inducible promoter-operator systems which 
allow for controlled overproduction of a heterologous 
polypeptide at an optimal time during the growth of the 
host cells. A particularly well-studied case is that of 
the Trp promoter-operator. See I. P. Crawford and G . V. 
Stauffer, Ann. Rev. Biochem. , 49 , 163-195 (1980). 
Biosynthesis of tryptophan in E. coli is catalyzed by a 
series of five enzymes referred to as TrpA, TrpB, TrpC, 
TrpD, and TrpE. The genes encoding these enzymes lie 
adjacent to and are controlled by the Trp promoter- 
operator, with the TrpE gene nearest the Trp leader. A 
series of deletion mutants have been isolated in which 
the amino terminus of the Trp leader peptide is fused to 
the carboxy portion of the TrpE protein so as to produce 
various leader-TrpE ("LE") proteins. See K. Bertrand, et 
al. f J. Mol. Biol. . 103, 319-337 (1976); G. F. Miozzari 
and C Yanofsky, J. Bact. , 133 , 1457-1466, (1978). 
Because these LE proteins can be produced at levels 8-10 
fold higher than the TrpE protein in the wild type 
configuration, these altered Trp genes have been used to 
produce a number of heterologous polypeptides. See 0. G. 
Kleid, et al., GB Pat. No. 2,073,203; B. P. Nichols and 
C. Yanofsky, Meth. Enzymol. , 101 , 155-164, (1983). 

The intracellular synthesis of a fusion protein by 
expression of a heterologous gene of interest fused to a 
well-expressed host gene is a favored means of achieving 
high levels of expression of a heterologous protein. For 
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the production of small polypeptide hormones in E. coli, 
this approach has permitted both increased levels of 
expression and increased stability of the fusion protein 
compared to the unfused hormones. See K. Itakura et al. , 
Science , 198 , 1056-1063, (1977); D. Goeddel et al., 
Nature , 281 , 544 r 548, (1979); J. Shine et al., Nature , 
285 , 456-461, (1980). It has been reported that fusion 
polypeptides of the TrpLE protein with either the 
foot-and-mouth disease virus capsid protein VP^ (D. G . 
Kleid, et al. / Science , 214, 1125-1129 (1981)) or with 
the influenza virus hemaglutinin protein HA (A. R. Davis 
et al., Gene , 21, 273-284 (1983)) are well-expressed. 
These fusion proteins are also extremely insoluble. 
Since the two viral proteins readily aggregate normally, 
it was thought that the aggregation of TrpLE fusion 
proteins was due to the physiochemical properties of the 
heterologous protein. 

We have now discovered that, in addition to high 
levels of expression, the TrpLE protein can confer 
properties upon the fusion protein which are useful in 
the purification of the heterologous polypeptide. In 
particular, replacement of a variable portion of the LE 
carboxy terminal region with a heterologous polypeptide, 
even a small, soluble polypeptide, results in a 
self -aggregating fusion protein. Purification of the 
fusion protein is greatly simplified since the aggregates 
are easily enriched and concentrated in the pellet 
obtained by differential centrif ugation of a cell 
lysate. This, useful purification step can thereby be 
used with a variety of heterologous genes. Since the 
specific fusion site can be varied within the TrpLE gene 
without loss of the desired self-aggregation property, 
convenient restriction sites within the TrpLE gene can be 
used to simplify the construction of the desired plasmid 
vectors. This flexibility in the choice of fusion site 
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allows other alterations in the TrpLE protein (as 
described herein) to enhance the purification of the 
heterologous polypeptide. 

In addition, by using a "three-frame linker" coupled 
to the TrpLE gene, one may obtain a desired heterologous 
polypeptide regardless of its triplet reading frame (as 
well as the two, out-of -frame "nonsense" polypeptides). 
Without the three-frame linker, one is forced to either 
select a fusion site that provides the proper reading 
frame orientation or to use a complicated cloning 
strategy, thus limiting the number of possible fusion 
sites. By using a three-frame linker, one may obtain the 
desired polypeptide easily even where the proper 
("sense") reading frame is unknown. 

Certain fusion proteins different from the proteins 
of the invention have been made using TrpLE genes in a 
distinctly different process. See, e.g., EPO patent 
applications 36,776 and 114,506. 

A major advance in understanding the basis of 
Acquired Immunodeficiency Syndrome (AIDS) has been the 
isolation of a novel class of retrovirus from patients 
suffering from AIDS or AIDS-Related Complex (ARC). This 
group of viruses is variously called lymphadenopathy 
virus (LAV) (S. Wain-Hobson, et al., Cell , 40, 9-17 
(1985)), Human T Lymphotropic Virus III (HTLV-III) (L. 
Ratner, et al., Nature , 313 , 277-84 (1985)) and 
AIDS-Associated Retrovirus (ARV) (R. Sanchez-Pescador , et 
al., Science , 227 , 484-92 (1985)). From a morphologic 
and biologic standpoint, this group of viruses is most 
closely related to the lentivirus family. So far, more 
than 100 isolates have been obtained by different 
research groups. Restriction enzyme analysis of proviral 
DNAs suggests a high degree of sequence polymorphism 
among various isolates. Comparison of nucleotide 
sequences between a limited number of cloned isolates 
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suggests that sequence variability is particularly 

pronounced within the envelope gene. 

We have sought to evaluate the significance of these 

structural differences among AIOS patients by comparing 

their individual immunological responses to viral 

infection. Serum samples from healthy individuals as 

well as ARC and AIOS patients were screened for 

immunoreactivity with cloned AIDS viral proteins using 

western blot analysis. Previous workers have used 

proteins derived from infectious AIDS viral particles in 

their assays. See, e.g., M. Sarngadharan, et al., 

Science , 224 , 506-08 (1984); 3. Schupbach, et al., 

Science , 224, 503-05 (1984). Our approach was to first 

clone and express subgenomic segments of the AIDS virus 

15 in coli using the TrpLE vectors described above in 

order to provide specific recombinant proteins or protein 

fragments for use as antigens in subsequent serological 

tests. The reagents so generated, in addition to being 

useful in diagnosis of viral infection, are potentially 

valuable as vaccine components. Other researchers have 

used less advantageous cloning vectors and methods to 

clone portions of the AIDS virus genome. See, for 

example, R. Gallo, et al., Science , 228 , 93-96 (1985); R. 

Gallo, et al., Nature , 315 , 151-154 (1985); F. 

_ Wong-Staal, et al., Cell , 41, 979-986 (1985). 
25 

Here we describe the construction of seven 
recombinant clones which span most of the AIOS viral 
genome. Upon induction, novel fusion proteins were 
produced that react immunologically with sera obtained 
^ from AIDS patients. We also disclose experiments in 
which the recombinant proteins were employed to screen 
152 sera in order to establish which of the seven clones 
are most useful in detecting antibodies against the AIDS 
virus. 
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We have also discovered that fragments of certain 
AIDS-TrpLE fusion proteins obtained by cyanogen bromide 
digestion retain the immunological activity of the intact 
protein. By using small but immunologically active 
fragments of the antigenic proteins, purification 
problems are greatly simplified. Background reactivity 
is decreased and sensitivity is thereby increased. 

The following terms appearing in the Specification 
and appended Claims are defined as follows: 

The term n GRF" stands for growth hormone releasing 
factor. HuGRF denotes human growth hormone releasing 
factor, whereas PGRF and BGRF denote the porcine and 
bovine varieties, respectively. 

The term "AIDS virus" as used herein refers to 
HTLV-III, ARV, LAV and related viruses, known to cause or 
suspected to cause AIDS, ARC, LAD, and related disorders. 

The term "HBV" refers to hepatitis B virus. 

The term "natural amino acid" refers to an amino 
acid from any source which is commonly found in nature, 
i.e., is a common constituent of naturally occurring 
proteins. Examples of natural amino acids are glycine, 
alanine, tryptophan, histidine, cysteine, methionine, 
phenylalanine, etc. An amino acid is "natural" whether 
it is extracted from biological sources or manufactured 
using chemical techniques. 

The term "synthetic amino acid" refers to amino 
acids which are known in the art, but are not normally 
constituents of proteins derived from biological 
sources. The term "synthetic amino acids" includes, 
without limitation, such amino acids as norleucine, 
homoarginine, homoserine, y-aminobutyric acid, and the 
like. 
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The term "sequence," as applied to a DNA sequence or 
a polypeptide sequence, refers to a DNA molecule or a 
polypeptide molecule in which the nucleotide or amino 
acid monomers are arranged in a particular order. 

A term in the form "TrpLE # ( restriction enzyme)" 
refers to the TrpLE deletion mutant DNA sequence from the 
5' end up to the corresponding restriction enzyme 
recognition site. Thus, the term "TrpLE*Sst II" refers 
to the E. coli DNA sequence encoding the Trp promoter and 
the E protein up to the first Sst II restriction site. 
Similarly, the term "TrpLE*BssH II" refers to the 
E. coli ON A sequence encoding the TrpLE protein up to the 
first BssH II restriction site. 

A term in the form "TrpLE- (restriction enzyme)- 
(heterologous sequence)" refers to the TrpLE deletion 
mutant DNA sequence encoding, from 5* to 3', the Trp 
promoter and the TrpLE protein up to the first 
corresponding restriction enzyme recognition site named, 
optionally followed by a three-frame linker segment, 
cleavable linker sequence, or both, with the heterologous 
20 DNA coding for the desired heterologous protein inserted 
at the restriction site of the three-frame linker (if 
present), at the 3' end of the cleavable linker sequence 
(if present), or at the 3' end of the TrpLE sequence. 
For example, the term "TrpLE-Sst II-HuGRF" refers to 
25 the DNA sequence which codes for the truncated LE 

protein, optionally a three-frame linker segment and/or a 
cleavable linker sequence, and human growth hormone 
releasing factor. 

A term in the form "LE-(heterologous polypeptide)" 
30 refers to the fusion protein produced by expressing the 
ONA sequence TrpLE«(restriction enzyme)-(heterologous 
sequence). For example, LE-HuGRF Leu27 refers to the 
fusion protein obtained by expressing either 
TrpLE-Sst II " HuGRF Leu27 ( com P risin 9 the LE 
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polypeptide up to the Sst II site), or TrpLE *BssH II- 
HuGRF Leu27 (comprising the LE polypeptide up to the 
BssH II site), optionally an oligopeptide encoded by a 
cleavable linker or a three-frame linker, and human 
growth releasing factor modified by replacing the amino 
acid' at position 27 with leucine. 

The term "cleavable linker sequence" refers to a 
nucleotide sequence which encodes for a peptide sequence 
which is easily cleavable by methods known in the art. 
For example, polypeptides containing methionine are 
easily cleaved using cyanogen bromide (CNBr), Asp-Pro 
peptide sequences are cleaved by acid hydrolysis, and 
Asp-Asp-Asp-Asp-Lys sequences are cleaved by the 
endopeptidase enterokinase . See, e.g., A. Light, et al., 
Anal. Biochem. , 106 , 199-206 (1980).. To avoid denaturing 
the heterologous polypeptide, one should avoid using 
cleavable linker sequences which encode cleavable peptide 
sequences that are also present in the heterologous 
polypeptide. For example, if the heterologous protein 
contains internal Asp-Pro links, it is preferred to use a 
cleavable linker sequence that does not code for 
Asp-Pro. However, we have now discovered that LE-KAL-10 
fusion proteins may be cleaved with CNBr, despite the 
presence of methionine residues within the KAL-10 
polypeptide, to yield certain polypeptide fragments which 
exhibit the same immunoreactivity as the intact KAL-10 
polypeptide. 

The term "three-frame linker" or "three-frame linker 
sequence" refers to a set of one or more DNA sequences 
which, when inserted into the TrpLE coding segment of the 
plasmid vector, allows the subsequent insertion and 
expression of the heterologous sequence in any of the 
three reading frames. The term "three-frame linker 
segment" refers to the portion of the three frame linker 
remaining between the TrpLE gene and the heterologous 
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sequence after the latter is inserted. In the Examples 
herein, 8, 10, and 12 base ONA sequences, each containing 
the recognition site of a particular restriction enzyme, 
were inserted into the TrpLE sequence. The heterologous 
sequence is then inserted at the enzyme recognition site 
in the three-frame linker, resulting in plasmids that 
have TrpLE-(l base)-(restriction site)-heterologous 
sequence, TrpLE-(2 bases)-( restriction site)-heterologous 
sequence, and TrpLE-(3 bases)-(restriction site)- 
heterologous sequence. Thus, the heterologous sequence 
may be expressed from this plasmid in each of the three 
reading frames, one "sense" frame (which encodes the 
desired polypeptide) and two "nonsense" frames. After 
expression, the sense polypeptide is easily identified 
using immunological reagents. 

The term "heterologous sequence" refers to a DNA 
molecule which encodes a protein or protein segment not 
native to the expression vector. Exemplary heterologous 
sequences encode polypeptides including, without 
limitation, human, bovine, or porcine growth hormone 
releasing factors, growth factors, bovine growth hormone, 
and antigenic viral or bacterial proteins, particularly 
AIDS viral proteins. 

As applied to a cleaved dsONA segment, the term "3* 
end" refers to the end of the dsDNA segment which would 
correspond to the 3 1 end of the mRNA which would be 
transcribed from the "sense" or coding strand. 



One aspect of the invention is a bacterial 
expression vector for the expression of fusion proteins, 
comprising a TrpLE ONA gene fragment, an optional three 
frame linker, an optional cleavable linker, and a 
heterologous DNA sequence. Where both a three frame 
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linker and a cleavable linker are used, they may appear 
in either order. 

Another aspect of the invention is a method for 
producing a bacterial expression vector for the 
5 expression of fusion proteins, which method comprises 

providing a plasmid containing a TrpLE DNA gene, cleaving 
the TrpLE DNA gene with a restriction endonuclease, and 
inserting a heterologous DNA sequence. One may insert 
the optional three frame linker before or after the 

10 optional cleavable linker- The optional cleavable linker 
sequence may be inserted at any point in the method. 

Another aspect of the invention is a fusion protein 
comprising a TrpLE polypeptide fragment, optionally a 
three frame linker segment, optionally a cleavable 

15 linker, and a heterologous polypeptide, or a fragment or 
derivative thereof. Where both a three frame linker and 
a cleavable linker are employed, they may appear in 
either order. 

Another aspect of the invention is a method for 

2Q inducing antibody formation in mammals by administering 
an effective amount of a fusion protein of the 
invention. The antibodies may be either polyclonal or 
monoclonal and may be used for detecting the presence of 
a heterologous polypeptide or for conferring immunity to 

2g viral, bacterial, or fungal infection. 

Another aspect of the invention is a method for 
reducing or preventing viral infection in a mammal by 
competitive binding to cellular viral receptors by 
administering an effective amount of a fusion protein of 
the invention. 

Another aspect of the invention is an immunoassay 
reagent for binding antibodies from mammals exposed to 
viruses, bacteria, or fungi, wherein said reagent is a 
fusion protein of the invention. 
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Another aspect of the invention is a vaccine for 
immunizing an animal against infection which vaccine 
comprises: a fusion protein of the invention or a portion 
thereof with a pharmaceutical^ acceptable excipient, 
preferably an adjuvant. 

Another aspect of the invention is the AIDS virus 
ENV fragments KAL-A and KAL-B. 



The manner in which these and other objects and 
advantages of the invention are obtained will become more 
apparent from the detailed description which follows and 
from the accompanying drawings, in which: 

Figure 1 illustrates the construction of an 
intermediate vector plasmid pTrpE-#22. 

Figure 2 illustrates the construction of the 
expression vector pTrpLE-#14. 

Figure 3 illustrates the ONA and amino acid 
sequences predicted to encode the mRNA produced from the 
plasmid pTrpLE*Sst "-HuGRF^^ and plasmid 
pTrpLE^Sst II " H uGRF Leu27 . The terminal sequences are 
based on information available in the literature, Amino 
acids of the HuGRF segment are shown in capital letters. 

Figure A illustrates a flow chart of the 
purification scheme used to produce HuGRF Leu27 ^ rom 
E. coli cells. 

Figure 5 depicts an SDS protein gel demonstrating 
high production of the TrpLE, TrpLE-BssH I*- HuGRF Leu2 7 
and TrpLE-Sst II-HuGRF Leu2? proteins from E . coli cells. 

Figure 6 depicts an SOS protein gel demonstrating 
the purification of TrpLE-BssH **- H uGRF Leu2? from 
E. coli by cell lysis and centrif ugation before CNBr 
cleavage. 

Figure 7 depicts HPLC profiles of peptides released 
by CNBr treatment of TrpLE-Sst II " H uGRF Leu27 (A) and 
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TrpLE-BssH H-HuGRF Leu27 (B). The HPLC fractions in 
(B) were assayed for GRF activity by activation of 
adenylate cyclase, and the results presented in (C). The 
arrow in (A) indicates the fraction with peak activity. 

Figure 8 depicts a western blot of TrpLE-BssH II- 
HuGRF Leu27 (A) and TrpLE-Sst II-HuGRF^ eU 2y (B) using 
anti-HuGRF antisera, after CNBr cleavage and HPLC 
fractionation of the fusion proteins. 

Figure 9 illustrates the purity of several 
HuGRF Leu 2 7 preparations by Coomassie Brilliant Blue 
stain (A), and western blotting with anti-HuGRF 
antisera. Authentic HuGRF (S) is compared to 
HuGRF Leu27 purified from two lots of 
TrpLE-BssH H-HuGRF Leu27 (lanes 1, 2) and 
TrpLE-Sst II - H uGRF Leu27 (lane 3). Unrftarked lanes 
contain molecular weight standards. 

Figure 10 depicts the HPLC profiles of lots 1, 2, 
and 3 described in Figure 9. 

Figure 11 is a map of the AIDS virus genome 
illustrating the locations of the specific clones 
described in relation to the major open reading frames of 
the virus (GAG, POL, ENV, A, and B). 

Figure 12 illustrates a reference pattern of 
TrpLE-AIDS fusion proteins and AIDS virus infected cells 
(BAG) visualized by western blotting with a pool of sera 
from three AIDS patients. 

Figure 13 illustrates results from the western 
blotting analysis of 19 human serum samples. Each serum 
sample was used to probe the various TrpLE-AIDS fusion 
proteins (GAG, POL, ENV), the TrpLE vector without an 
insert (VECTOR), and AIDS virus infected cells (AIDS). 
The numbers above each panel Indicate the number of 
positively reacting serum samples. 

Figure 14 illustrates Western blot results comparing 
the immunoreactivity of CNBr treated LE-GAG fusion 
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proteins generated by pSB8-Gag-#25 with LE-ENV fusion 
proteins produced by pTrpLE-KAL-ENV-#10 # 

Figure 15A shows an SDS-polyacrylamide gel stained 
with Coomasie Blue stain showing the proteins produced by 
pSB12-854AH8. Figure 15B depicts western blot analysis 
of the same E. coli extracts of pSB-854AHB 
demonstrating that the 35 Kd protein is the only 
immunoreactive protein entity. 

Figures 16A and 16B depict the HPLC fractionation 
and western blot analysis of the CNBr-cleaved fragments 
obtained in Example 16. 

One aspect of the invention is a bacterial 
expression vector for the expression of fusion proteins, 
which vector comprises a TrpLE DNA gene fragment and a 
heterologous DNA sequence. A preferred subgenus is the 
vector which further comprises a three-frame linker 
segment between said TrpLE fragment and said heterologous 
DNA sequence. A preferred class of the invention is the 
vector which further comprises a cleavable linker 
sequence between said TrpLE fragment and said 
heterologous DNA sequence, especially where said 
cleavable linker encodes Met, Asp-Pro, or 
Asp-Asp-Asp-Asp-Lys. A preferred subclass is the vector 
wherein said heterologous DNA sequence encodes: human, 
bovine, or porcine growth hormone releasing factor or a 
biologically active derivative or fragment thereof; an 
antigenic viral protein or a biologically active 
derivative or fragment thereof; or a growth factor or 
hormone or a biologically active derivative or fragment 
thereof. A preferred species is the vector encoding an 
antigenic viral protein in which the antigenic viral 
protein is an AIDS protein, particularly an ENV 
polypeptide, and especially the polypeptide KAL-10. 
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Another preferred species is that wherein the antigenic 
viral protein is a pre-S sequence of the surface antigen 
of HBV (hepatitis B virus). Another preferred species is 
the vector encoding a growth hormone releasing factor 
5 derivative, especially where the derivative is 

HuGRF Leu27' BGRF Leu27' or PGRF Leu27' Another 
preferred species is the vector encoding a growth 

hormone, especially where the growth hormone is bovine 

growth hormone. Presently preferred embodiments of the 

10 invention are the plasmids pTrpLE-KAL, 

pTrpLE-KAL-ENV-#10, pSB12-854, pSB12-854AHB , 

pTrpLE-Sst H-HuGRF Leu27 , 

pTrpLE-BssH H-HuGRF Leu27 -#5, pTrpLE-Sac II-PGRF, 
and 

15 pTrpLE BssH H-BGRF Leu27#15j as def - ned in £ he examples. 

Another aspect of the invention is a method for 
producing a bacterial expression vector for the 
expression of fusion proteins, which method comprises: 
2Q providing a plasmid containing a TrpLE DNA 

gene; 

cleaving the TrpLE ONA gene with a restriction 
endonuclease; and 

inserting a heterologous DNA sequence. A 

2s preferred subgenus is the method which further comprises 
inserting a cleavable linker sequence before or after 
inserting the heterologous sequence. A preferred class 
of the invention is the method wherein the cleavable 
linker sequence encodes Met, Asp-Pro, or 

3Q Asp-Asp-Asp-Asp-Lys, especially Met or Asp-Pro. A 
preferred subclass is the method wherein said 
heterologous DNA sequence encodes: human, bovine, or 
porcine growth hormone releasing factor or a biologically 
active derivative or fragment thereof; an antigenic viral 

35 protein or a biologically active derivative or fragment 
thereof; or a growth factor or hormone or a biologically 
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active derivative or fragment thereof. A preferred 
species is the method wherein the heterologous sequence 
encodes an antigenic viral protein in which the antigenic 
viral protein is an AIDS protein, particularly an ENV 
polypeptide, and especially the polypeptide KAL-10. 
Another preferred species is that wherein the antigenic 
viral protein is a pre-S sequence of HBV (hepatitis B 
virus). Another preferred species is the method wherein 
the heterologous sequence encodes a growth hormone 
releasing factor derivative, especially where the 
derivative is HuGRF Leu27 , BGRF Leu27 , or PGRF Leu2? . 
Another preferred species is the method wherein the 
heterologous sequence encodes a growth hormone, 
especially where the growth hormone is bovine growth 
hormone. 

Another preferred subgenus is the method* for 
producing a bacterial expression vector for the 
expression of fusion proteins, which method comprises: 
selecting a plasmid containing a TrpLE DNA 

gene; 

cleaving the TrpLE DNA gene with a restriction 
endonuclease; 

inserting a three frame linker; and 
inserting a heterologous DNA sequence within 
the three frame linker. A preferred subclass is the 
method wherein said heterologous DNA sequence encodes: 
human, bovine, or porcine growth hormone releasing factor 
or a biologically active derivative or fragment thereof; 
an antigenic viral protein or a biologically active 
derivative or fragment thereof; or a growth factor or 
hormone or a biologically active derivative or fragment 
thereof. A preferred species is the method wherein the 
heterologous sequence encodes an antigenic viral protein 
in which the antigenic viral protein is an AIDS protein, 
particularly an ENV polypeptide, and especially the 
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polypeptide KAL-10, Another preferred species is that 
wherein the antigenic viral protein is a pre-S sequence 
of HBV (hepatitis B virus). Another preferred species is 
the method wherein the heterologous sequence encodes a 
growth /hormone releasing factor derivative, especially 
where the derivative is HuGRF Leu27' BGRF Leu27* or 
PGRF Leu27 . Another preferred species is the method 
wherein the heterologous sequence encodes a growth 
hormone, especially where the growth hormone is bovine 
growth hormone. 

Another aspect of the invention is a fusion protein 
comprising a TrpLE polypeptide fragment and a 
heterologous polypeptide, or a fragment or derivative 
thereof. A preferred subgenus of the invention is the 
fusion protein which further comprises a three-frame 
linker oligopeptide segment between the TrpLE polypeptide 
fragment and the heterologous polypeptide. A preferred 
subclass is the fusion protein wherein the heterologous 
polypeptide is: human, bovine, or porcine growth hormone 
releasing factor or a biologically active derivative or 
fragment thereof; an antigenic viral protein or a 
biologically active derivative or fragment thereof; or a 
growth factor or hormone or a biologically active 
derivative or fragment thereof. A preferred species is 
that wherein the antigenic viral protein is a pre-S 
sequence of HBV (hepatitis B virus). Another preferred 
species is the fusion protein wherein the heterologous 
polypeptide is a growth hormone releasing factor 
derivative, .especially where the derivative is 

HuGRF Leu27' BGRF Leu27' or PGRF Leu27' Another 
preferred species is the fusion protein wherein the 
heterologous polypeptide is a growth hormone, especially 
where the growth hormone is bovine growth hormone. 
Another preferred species is the fusion protein wherein 
the heterologous polypeptide is an antigenic viral 
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protein in which the antigenic viral protein is an AIDS 
protein, particularly an ENV polypeptide, and especially 
the polypeptide KAL-10. A particularly preferred species 
is an immunoreactive KAL-10 fragment, especially KAL-A, 
KAL-B, or a mixture of the two, where KAL-A and KAL-B 
have the following sequences: 



Kal-A ; 

Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys 
Tyr Lys Val Val Lys He Glu Pro Leu Gly Val 
Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 
10 Arg Glu Lys Arg Ala Val Gly He Gly Ala Leu 

Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X 

Kal-B : 

Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu 
Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu 
Arg Ala He Glu Ala Gin Gin His Leu Leu Gin 
15 Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala 

Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp 
Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly 
Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn 
Ala Ser Trp Ser Asn Lys Ser Leu Glu Gin He 
Trp Asn Asn X 

^ wherein X represents homoserine lactone or its 

20 

hydrolysis or amidation products, methionine, or another 
naturally occurring amino acid. Other subsets and 
analogs of the two sequences set forth above have 
immunological activity and are within the scope of the 
^ invention. Subsets are generated from the above 
^ sequences by deleting amino acyl residues from either or 
both ends of the sequence, such that the resulting 
polypeptide retains at least substantially the same 
immunoreactiyity . Analogs are generated by replacing up 
to three amino acids with other natural or synthetic 

30 

amino acids known to those skilled in the art by 
recombinant or chemical methods, such that the resulting 
polypeptide retains at least substantially the same 
immunoreactivity . Analogs include subsets and sequences 
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which have up to three amino acyl residues within the 
sequence deleted. 

Another preferred subgenus of the invention is the 
fusion protein which further comprises a cleavable linker 
oligopeptide between the TrpLE polypeptide fragment and 
the heterologous polypeptide. A preferred class of the 
invention is the method wherein the cleavable linker 
oligopeptide is Met, Asp-Pro, or Asp-Asp-Asp-Asp-Lys , 
especially Met or Asp-Pro. A preferred subclass is the 
fusion protein wherein the heterologous polypeptide is: 
human, bovine, or porcine growth hormone releasing factor 
or a biologically active derivative or fragment thereof; 
an antigenic viral protein or a biologically active 
derivative or fragment thereof; or a growth factor or 
15 hormone or a biologically active derivative or fragment 
thereof. A preferred species is the fusion protein 
wherein the heterologous polypeptide is an antigenic 
viral protein in which the antigenic viral protein is an 
AIDS protein, particularly an ENV polypeptide, and 
especially the polypeptide KAL-10. Another preferred 
species is that wherein the antigenic viral protein is a 
pre-S sequence of HBV (hepatitis B virus). Another 
preferred species is the fusion protein wherein the 
heterologous polypeptide is a growth hormone releasing 
factor derivative, especially where the derivative is 

HuGRF Leu27' BGRF Leu27' or PGRF Leu2y An0ther 
preferred species is the fusion protein wherein the 

heterologous polypeptide is a growth hormone, especially 

where the growth hormone is bovine growth hormone. 

30 Another aspect of the invention is a method for 

inducing antibody formation in mammals by administering 
an effective amount of a fusion protein or the 
invention. A preferred subgenus is the method wherein 
said antibodies are polyclonal or monoclonal and are 

35 useful for detecting the presence of a heterologous 
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polypeptide. A preferred class of the invention is the 
method in which the heterologous polypeptides to be 
detected are viruses. A preferred species is that 
wherein the virus is AIDS virus or HBV. Another 
preferred subgenus is the method wherein said antibodies 
are neutralizing antibodies useful for conferring 
immunity to viral, bacterial, or fungal infection. A 
preferred species is that in which the antibodies confer 
immunity to AIDS. Another preferred species is that in 
which the antibodies confer immunity to HBY. 

Another aspect of the invention is a method for 
reducing or preventing viral infection in a mammal by 
competitive binding to cellular viral receptors by 
administering an effective amount of a fusion protein, 
wherein said fusion protein comprises a TrpLE polypeptide 
fragment and a virus polypeptide recognized by a 
mammalian cell virus receptor. A preferred species is 
the method in which the virus polypeptide is an AIDS 
virus polypeptide. 

Another aspect of the invention is an immunoassay 
reagent for binding antibodies from mammals exposed to 
viruses, bacteria, or fungi, wherein said reagent is a 
fusion protein comprising a TrpLE polypeptide fragment 
and an antigenic viral, bacterial or fungal polypeptide. 
A preferred class is the immunoassay reagent wherein said 
polypeptide is a viral polypeptide. A preferred subclass 
is the reagent wherein the viral polypeptide is an ENV or 
surface protein or a fragment thereof. A preferred 
species is the reagent wherein the ENV protein is an AIDS 
protein fragment, especially KAL-10, A particularly 
preferred species is an immunoreactive KAL-10 fragment, 
especially KAL-A, KAL-B, or a mixture of the two, or a 
subset or analog of KAL-A or KAL-B. Another preferred 
species is the reagent wherein the surface protein is the 
pre-S sequence of the surface antigen of HBV. 
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TrpLE 

The fusion between the DNA sequences encoding the LE 
protein and the heterologous polypeptide is made in a 
manner such that various distal portions of the LE 
protein will be replaced, in the same translational 
reading frame, by the heterologous polypeptide. 
Additionally, other amino acid insertions, deletions and 
substitutions in the LE protein may be introduced so as 
to facilitate the purification of the desired product. 

Cells are transformed by addition of the expression 
vehicles of the invention and grown under conditions in 
which the exogenous promoter-operator is repressed. For 
instance, in the case of the E. coli tryptophan 
promoter-operator (Trp promoter), cells are grown in the 
presence of added tryptophan. This represses the Trp 
promoter such that cell growth can proceed normally, 
without deleterious effect due to overexpression of the 
fusion protein. When the recombinant culture reaches a 
cell density appropriate for industrial production of the 
polypeptide, the external source of repression (e.g., 
tryptophan) is removed, or, alternatively, an external 
inducer (e.g., indole acrylic acid) is added. This 
derepresses the exogenous promoter-operator (e.g., Trp 
promoter) and highly efficient expression of the 
heterologous insert occurs. 

The invention provides a variety of useful 
intermediates and end products, including heterologous 
polypeptide-containing fusion proteins which can be 
easily separated from the remainder of the bacterial 
proteins by virtue of the self-aggregating properties 
conferred by the LE protein portion. The fusion protein 
may then be specifically cleaved to release the desired 
heterologous polypeptide, whose purification is 
facilitated by the choice of the fusion site within the 
LE protein. Alternatively, one may use the fusion 
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protein without cleavage, e.g., to produce vaccines or 
monoclonal antibodies for immunoassays. 

The original isolation of the TrpLE protein cloning 
region was the result of genetic manipulations of the 
g E. coli tryptophan promoter-operator. See K. Bertrand, 
C Squires and C, Yanofsky, J. Mol. Biol. . 103 , 319-337 
(1976); 6. F. Miozzari and C. Yanofsky, J. Bact .. 133, 
1457-1466 (1978). One of these original deletions, 
TrpLE1413, is available on plasmid pVVl (B. P. Nichols 
and C. Yanofsky, Meth. Enzymol. , 101, 155-164 (1983)) and 
was used to construct the LE protein vectors employed in 
the examples herein. The essential features of a ONA 
deletion required to form an effective LE protein vector 
for the purposes of this invention are, relative to the 
Trp promoter-operator (C. Yanofsky et al., Nuc. Acids 
Res., 9, 6647-6668 (1981)): 

a 5» terminal breakpoint near a base coding 
for amino acid #10 of the TrpL peptide; 

a 3' terminal breakpoint about two-thirds of 
the way through the TrpE coding region, preferably 
near amino acid #339. 

Such a deletion removes the Trp promoter-operator 
attenuator region, allowing for higher level of gene 
expression, and preserves a region of the TrpE protein 
from about amino acid #339 to the carboxy terminal amino 
acid, which is preferred for its ability to 
self-aggregate even when heterologous proteins are 
substituted for a portion of the carboxy coding region. 

A preferred embodiment of the invention is the LE 
B vector plasmid pTrpLE-#14. This is a pBR322-based 

plasmid containing the £. coli Trp promoter-operator and 
LE coding region of 191 amino acids from pVVl, followed 
in turn by a portion of the TrpD gene and a transcription 
terminator fragment from the E. coli RNA polymerase-B 
subunit gene (RpoC: See C. Squires et al., Nucleic Acids 
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Res. , 9, 6827-6840 (1981)). The terminator provides for 
termination of mRNA synthesis, thereby preventing 
transcription beyond the gene of interest which could 
interfere with plasmid replication or stability. 
Transcripts of the terminator region may also form 
structures which stabilize the 3 1 end of the mRNA against 
degradation. 

The expression of the LE protein either alone or 
fused to a heterologous gene is conveniently brought 
about by the tryptophan promoter in plasmid pTrpLE-#14 
and its derivatives as described herein. It is 
appreciated, however, that any similar well-characterized 
promoter-operator DNA region known to be capable of 
regulating gene expression in E. coli could be 
substituted for the Trp promoter used in these particular 
examples. The promoter region of the Trp 
promoter-operator commends itself for this use because it 
is so well-characterized (I. P. Crawford and G. V. 
Stauffer, Ann. Rev. Biochem. , 49, 163-195 (1980)) and can 
be easily induced to produce high levels of mRNA and 
hence protein products by growth in media either lacking 
tryptophan or containing the inducer compound indole 
acrylic acid (IAA). Other suitable control elements 
might include the following, or selected regions thereof; 
the Lac promoter, the Tac promoter, the Arabinose, 
Galactose or Alkaline Phosphatase operons, or a 
promoter-operator from bacteriophage lambda. 

Likewise, the provision of a transcription 
termination function by the RpoC DNA fragment and the 
selectable marker of ampicillin resistance employed in 
pTrpLE-#14 were used by way of an example and are not 
meant to limit the scope of this invention. 
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GRF 

Growth hormone releasing factor (GRF) is a small (AA 

amino acid) polypeptide secreted by the hypothalamus 

which stimulates the release of growth hormone by the 

^ pituitary in mammals* GRF is useful for increasing 

weight gain in animals, and for increasing milk 

production in lactating mammals. GRFs from different 

species typically vary by a few amino acids in 

composition, as illustrated below: 

1 5 10 

10 HuGRF : Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr 

PGRF : Tyr Ala — ffsp — Ala He Phe rhr Asn Ser Tyr 

BGRF : Tyr Ala Asp Ala He Phe Thr Asn Ser Tyr 

15 20 

Arg Lys Val Leu Gly Gin Leu Ser Ala Arg 

Erg Cyi Val Ceu Ely 5Tn Ceu Ser Ala Arg 

15 Arg Lys Val Leu Gly Gin Leu Ser -Ala Arg 

25 27 30 

Lys Leu Leu Gin Asp lie Met Ser Arg Gin 

Cys Ceu Ceu GTn Asp Tie Wei 5er ffrg Gin 

Lys Leu Leu Gin Asp He Met Asn Arg Gin 

35 AO 

20 Gin Gly Glu Ser Asn Gin Glu Arg Gly Ala 

GTn Gly GTu Erg &sn GTn GTu EIn Ely TTTa 

Gin Gly Glu Arg Asn Gin Glu Gin Gly Ala 

AA 

Arg Ala Arg Leu -NH? 
Erg Val Arg Ceu-NH 2 
25 Lys Val Arg Leu-NH 2 

Production in E. coli of human growth hormone 
releasing factor (HuGRF) was achieved by fusing the HuGRF 
DNA sequence. to the TrpLE sequence. In a preferred 
embodiment, the fusion between the HuGRF and the LE 
protein coding regions was via the codon for methionine 
(ATG), to allow for release of the HuGRF via specific 
cleavage at Met using CNBr. To prevent cleavage at the 
methionine in position 27 of HuGRF, position 27 was 
changed to a leucine in the synthetic DNA which was 

36 
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cloned into the LE protein vector. Leucine was chosen 

since it is most often substituted for methionine in 

naturally occurring mutations, but other naturally 

occurring amino acids may be substituted to prevent CNBr 

cleavage. The synthetic DNA was also designed to 

maximize the use of amino acid codons which are favored 

in highly expressed E. coli proteins (R. Grantham et al., 

Nucleic Acids Res. , 9, r43-r74 (1981)), to minimize 

potential secondary structure in the resultant mRNA , and 

to provide convenient restriction sites which are useful 

for screening the desired clone and facilitating future 

alternatives in the DNA sequence. The synthetic DNA 

coding for the H uGRF Leu2? was ligated to either the 

Sac II site of the LE gene, replacing the carboxy 

terminal 73 amino acids of the LE protein, or to the 

BssH 11 site, replacing 11 amino acids. 

Although the examples cited interpose a methionine 

between the LE protein and the heterologous polypeptide, 

it will be appreciated that this junction may comprise 

any combination of the twenty naturally occurring amino 

acids, or none at all, and need not be specifically 

cleavable by chemical or enzymatic means, yet still 

remain within the scope of the invention. 

Good induction of expression from the Trp 

promoter-operator of pla.smids encoding the HuGRF L eu 27 

fused to either the BssH II or Sac II sites of the LE 

gene was readily achieved (Figure 5). The kinetics of 

synthesis and the. extent of accumulation of the 

HuGRF Leu2 y were dependent upon the composition of the 

culturing media. Indole acrylic acid (IAA) was required 

to maximally induce synthesis in a relatively rich 

medium, L broth. In contrast, inclusion of IAA in a 

minimal medium, M-9, altered only the kinetics of 

accumulation, but not the final level, of the 

LE-HuGRF, ™ As much as 35% of total protein was 
35 Leu27 
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LE-HuGRF Leu2? after growth in M-9, compared to 15-20% 
in L broth plus IAA, Maximal stability of the plasmids 
is obtained by growth in media containing 20-100 yg/ml 
tryptophan until expression is desired. 

Purification of the HuGRF teu27 was commenced bv 
recovery of the insoluble aggregate of LE ~ HuGRF L e u27 
protein from the lysed cells by low speed centrif ugation 
(Figure 6). This material was treated with CNBr in 70% 
formic acid. Formic acid serves as a preferred solvent 
to dissolve the LE fusion protein aggregates in the 
pellet and thereby provide a homogenous solution for 
further processing. Where the fusion protein itself is 
the desired product, solvents such as SDS or urea may be 
used to dissolve the LE fusion protein aggregates prior 
to further processing. After removal of excess CNBr and 
formic acid, the HuGRF L eu 27 P e P tide was extracted from 
the insoluble material with IN acetic acid. The extract 
was then dialyzed into dilute buffer before fractionation 
by reverse phase HPLC. 

Good resolution of HuGRF Leu27 has been achieved 
using either reverse phase HPLC or a combination of 
CM-cellulose and HPLC (Figures 7 and 8). The latter 
approach was preferred for material isolated from the 
TrpLE-Sac II ~ HuGRF Leu27 clone since CM-cellulose 
removed some contaminants which were not possible to 
separate by HPLC alone. Fractionation of the dialysate 
prepared from the TrpLE*BssH H- H uGRF Leu27 clone 
(Fig. 7B) or the TrpLE-Sac XI - H uGRF Leu27 clone 
(Fig. 7A) on HPLC gave HPLC profiles of total protein 
that were nearly identical. In contrast, when analyzed 
by the sensitive western blotting technique (Fig. 8B), 
the H uGRF Leu27 from the Sac II clone was shown to elute 
from the HPLC with immunoreactive material of higher 
molecular weight. When the dialysate from the Sac II 
cloned HuGRF Leu 27 was first fractionated on 
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CM-cellulose, the HPLC-purif ied material was 
substantially free of these contaminants (Figure 9B). 
HuGRF Leu27 of the greatest purity was isolated from the 
BssH II clone, even without a CM-cellulose step, because 
the immunoreactive contaminants were well separated by 
HPLC (Fig. 8A). The identity of these immunoreactive 
products was based upon their co-migration with the 
immunoreactive products seen with limiting amounts of 
CNBr, their ability to stimulate adenylate cyclase in an 
in vitro GRF assay, and their reactivity with anti-HuGRF 
sera. 

The amino acid sequence of the material isolated by 
preparative HPLC was found to be correct, including the 
substitution of Leu for Met at position 27. Samples with 
a purity of greater than 95% (Figures 9 and 10) were 
shown to stimulate both adenylate cyclase in an in vitro 
assay using bovine pituitary cell extracts and growth 
hormone release from pituitary cells in culture. 

The porcine analog of HuGRF (PGRF) as isolated from 
pig hypothalami differs in amino acid sequence from HuGRF 
at three positions towards the car boxy terminus: Ser-34 
+ Arg; Arg-38 + Gin; Ala-42 > Val (Bohlen et al., 
Biochem. Biophys. Res. Comm. , 116 , 726-734 (1983)). 
Expression of the PGRF Leu27 anal °9 °f PGRF as a fusion 
to the TrpLE protein was therefore achieved by 
specifically deleting the later portion of the 
LE'BssH II-HuGRF Leu27 coding region using restriction 
endonucleases, and substituting in its place a synthetic 
DNA fragment coding for the carboxy terminus of PGRF. As 
in the case of the HuGRP L eU 27 hormone, the 
LE'BssH II-PGRF Leu27 fusion protein was produced in 
high yield from induced cells, quantitatively sedimented, 
and pgrf l 6U 27 easily purified by the identical 
procedure used for the HuGRf= L eu 27* Mater ^ a ^- isolated 
by preparative HPLC was shown to be pure by analysis of 
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the amino acid composition and biologically active by the 
same criteria applied to H uGRF Leu27 above. 

The bovine analog of HuGRF (BGRF), as isolated from 
bovine hypothalami differs from the HuGRF in five 
positions towards the carboxy terminus: Ser-28 + Asn; 
Ser-34 * Arg; Arg-.38 + Gin; Arg-41 ♦ Lys; Ala-42 
♦ Val. In an analogous manner to the construction and 
expression of the PGRF Leu2 7i a clone encoding the 
LE-BssH II-BGRF Leu27 polypeptide was constructed, 
expressed and the BGRF Leu27 successfully purified and 
shown to be pure and biologically active. 

The same procedure was thereby successfully applied 
for expression of the LE^BssH II-PGRF, L eu 27 fusion 
polypeptide and purification of the GRF L eu 27 from three 
different sources, human, porcine and bovine. 

AIDS 

We have described above the construction and 
utilization of an E. coli expression vector to produce, 
in high yield, a number of animal-derived growth hormone 
releasing factors. This vector, designated pTrpLE, uses 
the strong bacterial promoter from the tryptophan 
promoter-operator to express a novel protein known as 
LE. The expression of each growth hormone releasing 
factor was achieved by fusion of the respective growth 
hormone releasing factor gene, in the proper 
translational reading frame, to the carboxy end of the LE 
gene. The LE carrier protein is itself a fusion protein 
between the amino terminus of the TrpE protein and a 
segment of the Trp leader. By linking foreign gene 
sequences in frame and downstream from the LE protein, we 
have successfully employed this and related vectors to 
express a number of heterologous polypeptides in high 
yield. 
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By inserting synthetic BamH I linkers of various 
lengths (e.g., 8, 10 and 12 nucleotides) at the unique 
BssH II site of pTrpLE, we have created a new set of 
expression vectors which allow expression of any 
5 translational open reading frame that is flanked with 
compatible restriction enzyme ends. For instance, 
bacterial or viral DNA can be cleaved (completely or 
partially) with Sau3A (GATC) and ligated to the three 
vectors, which have been previously cleaved with BamH I 
(GGATCC) and treated with alkaline phosphatase. Using 
antibodies directed against the LE carrier or the 
heterologous peptide, the transf ormants producing 
recombinant immunoreactive proteins can be identified. 

To generate an expression library for the AIDS viral 
15 genome, a fragment of DNA derived from pBenn#20 was 

partially cleaved with restriction endonuclease Sau3A and 
inserted into the three open-reading-frame vectors, pSB8, 
pSBlO and pSB12. Plasmid pBenn#20 had been previously 
constructed as a recombinant plasmid containing an 8 Kb 
segment of the proviral DNA, as well as 9 Kb of flanking 

20 

cellular DNA. 

Transf ormants were screened for the presence of AIDS 

viral inserts by colony hybridization with radioactively 

labelled DNA fragments derived from pBenn#20. The 

_ results indicated that greater than 80* of the 
25 

transf ormants contained inserts. Moreover, the 
transf ormants could thereby be divided into subgroups 
based on the region of the AIDS viral genome from which 
the inserts were derived. Selected clones from these 
subgroups were then tested for their ability to produce 

30 

protein molecules that were immunoreactive with anti-AIDS 
virus sera. Protein extracts prepared from induced 
recombinant bacterial cultures were subjected to western 
blot analyses using pooled sera from three patients, each 

35 
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of whom had previously been identified as producing 
anti-AIDS virus antibodies. 

Plasmid DNAs were then prepared from positively 
reactive colonies and their nucleotide sequences 
determined. The location of each insert within the AIDS 
provirus was established by comparing the DNA sequence at 
the boundaries of the insert with the published 
nucleotide sequence of the AIDS virus genome (Table 1). 
The results confirmed that each antibody-reactive clone 
contained an AIDS viral DNA fragment inserted in the 
proper orientation and correct reading frame with regard 
to the transcription and translation of the LE carrier 
protein. Figure 11 summarizes the genomic locations of 
five of the Sau3A recombinant clones, and Figure 12 shows 
their protein profiles as revealed by western blot 
analysis. ■ 

In addition to the genomic library derived from the 
AIDS virus, which was based on the insertion of random 
Sau3A fragments into the three open-reading-frame 
vectors, two additional clones containing specific AIDS 
virus DNA fragments were constructed. (Figure 11) 

The first specific construct utilized an LE-derived 
vector (designated pTrpLE-KAL) in which the nucleotide 
sequence distal to the unique BssH II site was modified 
to contain cleavage sites for both Kpn I and BamH I, as 
25 well as to encode the pentapeptide Asp-Asp-Asp-Asp-Lys. 
This peptidyl linkage can serve as a specific recognition 
site for cleavage by the endopeptidase enterokinase . 

Into pTrpLE-KAL was inserted a 2.15 Kb Kpn I to 
BamH I DNA fragment of AIDS virus DNA from plasmid 
pBenn#20. This fragment spans a major portion of the ENV 
gene. Upon induction the resultant plasmid, known as 
pTrpLE-KAL-ENV-#10, produced a reproducible pattern of 
protein products on SOS gels that were strongly 
immunoreactive on western blots incubated with the 
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anti-AIDS sera (Figure 12). This demonstrated that the 
AIOS viral gene was inserted in the correct translational 
reading frame and downstream from the LE gene. 

A second specific clone, designated pSB-Gag-#25, was 
constructed with a 0.95 Kb Pvu II to Bgl II DNA fragment 
from the AIOS virus GAG gene (Figure 11) inserted in the 
correct reading frame downstream from the LE gene. Upon 
induction, this transformant produced multiple 
polypeptides which were immuno-reactive with anti-AIOS 
sera (Figure 12). 

Western blotting was used with sera from patients 
suffering from AIOS or related diseases to identify viral 
target antigens. The expression vectors described herein 
were used to produce the AIOS proteins which were used as 
antigens in the western blot. An envelope-derived 
protein was most clearly and consistently recognized by 
antibodies from patients with lymphadenopathy , ARC, and 
AIDS (Figure 13). However, a preponderance of the 
members of high risk groups (e.g., homosexuals and 
intravenous drug users) also carried antibodies to the 
ENV protein. 

While antibodies to GAG and POL proteins have been 
reported to be useful indicators of AIDS viral infection, 
our results demonstrate that only a subset of the 
individuals with antibodies to ENV also have antibodies 
to GAG or POL proteins. Conversely, we identified no 
individuals with antibodies to GAG or POL who did not 
also have antibodies to ENV. Thus, the recombinant AIDS 
ENV proteins, produced by the ENV clone 
pTrpLE-KAL-ENV-#10, are more appropriate diagnostic 
reagents for AIDS virus infection than proteins derived 
from other regions of the viral genome, such as GAG or 
POL.. The multiple ENV related protein species produced 
by pTrpLE-KAL-ENV-#10 provide a reproducible recognition 
pattern on western blots to facilitate unequivocal 
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interpretation of immunoassay results. Production of 
these ENV related protein reagents in E. coli is safer 
and more economical than production using AIDS-inf ected 
animal cell cultures. 

The origin and function of the sequence variability 
in the AIDS proviral DNAs, especially prevalent in the 
ENV gene, is not yet known. The virus cloned and 
sequenced herein was originally obtained as an LAV virus 
stock from the group at the Institut Pasteur. Although 
the bacterially produced LE-ENV protein lacks the 
glycosylation normally present and encompasses only a 
portion of the entire ENV open reading frame, antibodies 
from all AIDS and ARCS patients reacted strongly with the 
LE-ENV fusion protein. Thus, the sequence variability of 
AIDS proviral DNA does not seem to present a problem in 
diagnosis using portions of the ENV open reading frame. 
The AIDS ENV polypeptide is thus useful as a diagnostic 
reagent. 

It has now been found that certain fragments of 
LE-ENV fusion proteins retain the immunoreactivity 
exhibited by the intact fusion protein or intact ENV 
protein. Upon comparision between several AIDS strains, 
it was found that these fragments display high (190%) 
sequence conservation. In other words, these fragments 
contain sequences which are identical or nearly identical 
regardless of which strain the sequence was derived 
from. Two especially preferred fragments, a 55 amino 
acid section (Kal-A) and a 91 amino acid section (Kal-B) , 
can be obtained by CNBr digestion of KAL-10. These 
fragments have the following sequences: 
Kal-A ; 

Y Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
Lys Val Val Lys He Glu Pro Leu Gly Val Ala 
Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 
Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe 
Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X 
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Kal-B : 

Z Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 
Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg 
He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys 
5 Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 

Ser Trp Ser Asn Lys Ser Leu Glu Gin lie Trp 
Asn Asn X 

X represents homoserine lactone or its hydrolysis or 
amidation products, which is a result of CNBr cleavage. 

10 Y is Met-Arg or Arg and Z is Met-Thr or Thr. Other 

subsets and analogs of the two sequences set forth above 
which have immunological activity are within the scope of 
the invention. Such subsets may be generated by 
replacing up to three amino acids with other natural or 

15 synthetic amino acids known to those skilled in the art. 
Analogs are generated by replacing up to three amino . 
acids with other natural or synthetic amino acids known 
to those skilled in the art by recombinant or chemical 
methods, such that the resulting polypeptide retains at 

20 least substantially the same immunoreactivity . Analogs 
include subsets and sequences which have up to three 
amino acyl residues within the sequence deleted. 

Additionally, we have discovered that expression of 
a Hind III-BamH i deletion mutant of Kal-10 produces a 

25 truncated ENV polypeptide with excellent 

immunoreactivity. Additionally, the truncated 
polypeptide is expressed in high yield, and may 
constitute about 5% of the total cellular protein in an 
induced cultuxe. Use of this clone simplifies the 

30 purification of immunoreactive polypeptides which are 
ideal for stimulating the production of antibodies for 
use in diagnostic assays. 
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EXAMPLES AND PREPARATIONS 
The descriptions in the following examples and 
preparations will be clear to those skilled in the art of 
molecular biology or biochemistry. More detailed 
descriptions of generally used techniques for nucleic 
acid manipulation such as treatment with DNA 
polynucleotide kinase or DNA ligase, separation by 
electrophoresis, ethanol precipitation, electroelution , 
and other methods may be found in commonly used texts* 
See, for example, "Molecular Cloning, A Laboratory 
Manual" by T. Maniatis, et al., Cold Spring Harbor 
Laboratory (1982). 

The cleavage of DNA by restriction enzyme digestion 
was carried out according to the current recommendations 
of the commercial suppliers. 

PREPARATION 1 

Parental Plasmids 

The plasmid pBR322 is about 4,400 nucleotide bases 
(4.4 Kb) in length and carries genes for resistance to 
20 the antibiotics ampicillin (Amp) and tetracycline (Tet). 
Its derivation and characterization have been described 
(F. Bolivar et al., Gene , 2, 95-113 (1977); Locus pBR322 
GENBANK, DNA Database, Release 34 (1985)). The plasmids 
pUC8 and pUC9 each contain a copy of the Lac promoter and 
25 the amino terminus of the gene coding for the enzyme 
S-galactosidase. Into the amino terminal region of 
this gene is inserted a "cassette 11 comprising a DNA 
sequence recognized by several useful restriction 
enzymes. The plasmid pUC8 and pUC9 differ only in the 
30 orientation of this "cassette" and thereby facilitate the 
manipulation of DNA fragments during cloning. A 
description of the construction and characterization of 
the pUC plasmids has been published (J. Vieira and J. 
Messing, Gene , 19, 259-268 (1982)). 
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The E. coli Trp promoter-operator, the entire 
nucleotide sequence of which has been determined and 
analyzed (C. Yanofsky, et al., Nucleic Acids Res ., 9, 
6647-6668 (1981); Locus ECOTRP, GENBANK, DNA Database, 
Release 34 (1985)), was the source of the Trp promoter 
DNA fragment utilized. The particular plasmid employed 
was pN0342 which contains a 0.45 Kb fragment of E. coli 
Trp promoter-operator DNA from the closest Pvu II site 
upstream of the Trp promoter to the first Hinf I site, 
located in the TrpE gene. To construct pN0342, the 
Hinf I site on the Trp DNA was filled in and both ends 
(the Pvu II end and the filled-in Hinf I) then ligated to 
a Hind III ten base pair-long linker before insertion of 
the modified DNA fragment into the Hind III site of 
pBR322. The Hind III site nearest the EcoR I site 
preceding the Trp promoter of the pBR322 parent was then 
destroyed by cutting with Hind III, filling in the ends 
and religating. The Trp promoter-operator DNA could thus 
be removed from the resultant plasmid by digestion with 
EcoR I and Hind III. This final construct, pN0342, was 
obtained from Dr. Neil Osheroff, Biochemistry Department, 
Stanford University, Palo Alto, California 94305. 

The DNA containing the transcription termination 
sequence utilized in the constructions described herein 
was derived from plasmid pVV202t, which was purchased 
from the laboratory of C. Yanofsky, Department of 
Biological Sciences, Stanford University, Palo Alto, 
California 94305. A 0.37 Kb BamH I fragment from pVV202t 
encompasses 325 base pairs from the transcription 
termination region of the E. coli rpoC gene (C. Squires, 
et al. Nucleic Acids Res. . 9, 6817-6840 (1981); Locus 
ECORPLRP, bases #11226-11551 GENBANK Database, (October, 
1984)). The TrpLE gene was obtained from plasmid pWl, 
which has been previously described (B.P. Nichols, and 
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C, Yanofsky, Methods in Enzymology , 101 , 155-164 (1983)), 
and was purchased from the laboratory of C. Yanofsky. 

Plasmid pBenn#20 DNA was used as the source of AIDS 
viral DNA for the various plasmid constructions. 
Derivation of pBenn#20 is as follows: a T lymphocyte 
culture (A3.01),(T. Folks et al., Proc. Nat. Acad. Scl. 
USA, 82, 4539-43 (1985)) was infected with the AIDS virus 
NA-2 (M. Martin, et al., manuscript in preparation). 
Nine days postinfection, the total cellular DNA was 
isolated from the infected culture. The high molecular 
weight cellular DNA (10 yg) was cleaved with 
restriction enzyme BamH I. 5 yg of the cleaved 
cellular DNA was ligated to 1 ug of BamH I-cleaved 
lambda Jl DNA ( Nature , 308 , pp 856-858, (1984)). The in 
vitro packaged ligated DNA was then used to infect 
E. coli cells. Recombinant phage containing AIDS 
proviral DNA were identified by means of in situ plaque 
hybridization, using a radioactively labelled AIDS viral 
cDNA as the hybridization probe. A positive phage plaque 
was identified. Analysis of DNA purified from the 
recombinant phage particles showed an insert of 17 Kb 
which included 8 Kb of the viral genome (the left long 
terminal repeat sequence, the GAG genes, the polymerase 
gene and three fourths of the envelope gene) as well as 
9 Kb of the cellular flanking sequence. The 17 Kb insert 
was subsequently transferred to plasmid pBR322 via the 
unique BamH I site. The progeny plasmid was designated 
pBenn#20. Plasmid pBenn#20 is publicly available through 
Dr. Malcolm Martin, National Institutes of Health, 
Bethesda, Maryland. 

PREPARATION 2 
Oligonucleotide Synthesis 

The protected deoxyribonucleosides were prepared as 
described by Ti, et al., J. Am. Chem. Soc. , 104, 1316 
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(1982). All oligonucleotides may be synthesized by the 
solid phase phosphoramidite triester approach (M.H. 
Caruthers, et al., in "Gene Amplification and Analysis" 
T. Papas, M. Rosenberg, J. Chirikjian, Eds., Elsevier, 
New York, pp. 1-26 (1983)). Each chain can be built up 
using successive cycles of deprotection and condensation 
using 20 molar equivalents of 5'-0-N protected 
deoxynucleoside- 3" -Q-methyl-N-morpholino 
phosphoramidites (T. Oorper and E.L. Winnaker, Nucleic 
Acids Res. , ll t 2575 (1983)) activated by tetrazole, 
oxidation, and blocking of unreacted sites. After 
completion of synthesis, the methoxy and the N- 
protecting groups are removed (Caruthers (1983) op. cit.) 
and the crude dimethoxy trityl oligonucleotide is purified 
by reversed phase HPLC. After treatment with 80S acetic 
acid the fully deprotected product is further purified by 
electrophoresis on a preparative 20* polyacrylamide gel 
containing 7 M urea. Purified oligonucleotides are 
labelled using [y- P]ATP and polynucleotide kinase. 
The size and purity is then confirmed by electrophoresis 
on a 20* polyacrylamide/7 M urea gel. 

EXAMPLE 1 
Preparation of TrpLE plasmid 

The construction of the plasmid cloning vector 
pTrpLE-#14 was a multi-step procedure (Figures 1 and 2). 
In the first step the plasmid pVV202t, which contains 
both the Trp E and D genes, as well as a portion of the 
TrpC gene, was. treated to create a unique Hind III site. 
After digestion of 1.0 ug of pVV202t DNA with Hind III, 
phenol-CHCl^ extraction and ethanol (EtOH) 
precipitation, 40 ng of the redissolved DNA was 
self-ligated with T4 0NA ligase and the products 
transformed into E. coli HblOl. Ampicillin resistant 
colonies were screened for the presence of a unique 
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Hind III site between the start of the TrpD gene and the 
terminator; pVV-AHind III-#3 had the desired site. 

It was desirable to have a plasmid with a unique 
BssH II restriction site in the TrpE gene for further 
cloning. Plasmid pVV-AHind III-#3 was found to have 
one or more BssH /II restriction sites upstream of the Trp 
promoter in addition to the BssH II site in the TrpE 
gene. Consequently, the approximately 2.5 Kb fragment 
(EcoR 1 to Hpa I) of pVV-AHind III-#3 which contained 
part of the trp promoter, was replaced with a 
functionally analogous 282 bp EcoR I to Hpa I fragment 
from plasmid pN0342. The 0.28 Kb EcoR I to Hpa I 
fragment from pN0342 was isolated by electroelution from 
a 5% polyacrylamide gel after digestion of 5 yg of 
DNA. Then 10 ng of this DNA fragment were ligated with 
80 ng of pVV-AHind III-#3, which had been previously 
cleaved with EcoR I and Hpa I. The products were 
transformed into E. coli HblOl cells and the ampicillin 
resistant colonies were screened by digestion with 
BssH II to identify the desired clone, pTrpE-#22. 

Replacement of the Trp promoter-Leader-E gene region 
of pTrpE-#22 with the analogous portion of pVVl was the 
third and final step. ONA from pVVl (100 ug) was 
digested with BssH II and Hpa I and electroeluted to 
produce fragments of about 6.0, 4.0, 1.8, 0.6, 0.57, and 
0.54 Kb. The 0.57 Kb fragment containing the TrpLE gene 
was recovered by electroelution from the 4% 
polyacrylamide gel used to separate the digestion 
products. After phenol-CHCl 3 extraction and EtOH 
precipitation, 0.2 ug of the redissolved 0.57 Kb ONA 
fragment were ligated with 0.25 vg of the 5.1 Kb DNA 
fragment of pTrpE-#22, obtained by digestion with BssH II 
and Hpa I. The ligation products were digested with 
BstX I, because this recognition site resides in the 
portion of ONA deleted from the Trp promoter-operator in 

89821 25160-FF 



0212532 



10 



15 



20 



25 



35 



-38- 



pVVl. Among the ampicillin resistant colonies, 
pTrpLE-#I4 was identified as having the correct structure. 

EXAMPLE 2 

Preparation of a clone coding for TrpLE«Sst II-HuGRF 

A synthetic gene coding for HuGRF Leu27 was 
designed to maximize the use of amino acid codons which 
are favored in highly expressed E. coli proteins, to 
minimize potential secondary structure in the resultant 
mRNA, to provide convenient restriction sites for 
screening the desired clone and to facilitate future 
alternatives in the DNA sequence. The synthetic gene 
sequence and the eighteen discrete oligodeoxynucleotides 
comprising the synthetic gene sequence are shown in the 
scheme below, and may be referred to in connection with 
the following more particularized discussion. 

To facilitate cloning, the HuGrf l 6u 27 9 ene was 
designed to be cloned in two segments; the A segment, 
comprised of ten oligodeoxynucleotides (Al through A10), 
coding for the amino terminal portion of HuGRF Leu27' 
and the B segment, comprised of eight 
oligodeoxynucleotides, coding for the carboxy terminal 
portion of the peptide, as illustrated in the following 
scheme: 



Met Tyr Ala Asp Ala He Phe Thr 
Sac II Nde I 

Al A3 A5 

(C)CGC GGG ATG GTC CAT ATG TAC GCT GAT GCT ATA TTC ACT 
CC TAC CAG GTA TAC ATG CGA CTA CGA TAT AAG T GA 

30 (BssH II compatible) 

Asn Ser Tyr Arg Lys Val Leu Glu Gin Leu Ser Ala Arg Lys 

A5 A7 A9 Hind III 

AAC TCT TAC CGC AAG GTT CTG GGC CAA TTA TCA GCA CGC A 

TTG AGA ATG GCG TTC CAA GAC CCG GTT AAT AGT CGT GCG TTC GA 

A6 A8 A10 

89821 25160-FF 



0212532 



10 



15 



20 



30 



-39- 

Leu Leu Gin Asp He Leu Ser Arg Gin Gin Gly Glu Ser Asn 

Bl Pst I B 3 Sal I B5 

AG CTT CTG CAG GAC ATI C TT AG T CGA CAA CAG GGC GAA FC F AAC 
A GAC GTC CTG TAA GAA TCA GCT GTT GTC CCG CTT AGA TTG 
ET2 ' ff5 



Gin Glu Arg Gly Ala Arg Ala Arg Leu STOP 

B5 B7 ■ Bel I (BamH I 

CAG GAG CGT GGC GCC AGA GCC AGA CTG T compatible) 
GTC CTC GCA CCG CGG TCT CGG TCT GAC ACTAG 
B6 B8 



To assemble the A segment, each of the ten 
oligodeoxynucleotides (A1-A10) was individually 
phosphorylated. 50 pm of each of the ten phosphorylated 
oligodeoxynucleotides were then combined in a single 
tube. One tenth volume (20 yl) of 3 M sodium acetate 
(NaOAc) solution and 4 volumes (800 yl) of absolute 
EtOH were added with mixing. The solution was chilled 
for 15 minutes at -70°C and then centrifuged at 12,000 x 
G at 4°C for 15 minutes. The supernatant was removed and 
the pellet dried in vacuo . The dried pellet was then 
resuspended in 48 yl of 66 mM Tris-HCl (pH 7.5), 6.6 mM 
MgCl 2 , 10 mM dithiothreitol , and 0.1 mM ATP. To anneal 
the oligonucleotides, the solution was heated to 40°C and 
then allowed to cool slowly over a 4 hour period to 
20°C. 5 units of T4 DNA ligase were then added and the 
solution incubated for 12 hours at 20°C. The annealing 
and ligation reaction was repeated, and the ligated 
mixture phenol extracted and EtOH-precipitated. The 
dried, precipitated DNA was resuspended in 50 yl of 
sterile distilled water, and 35 yl of the ligated 
oligodeoxynucleotides were cleaved with 100 units each of 
Hind III and Sst II. The reaction mixture was incubated 
at 37°C for 4 hours and EtOH-precipitated. The 
Sst II/Hind III-HuGRF-A gene segment was gel purified 
from unreacted and partially ligated DNA fragments by 
preparative electophoresis on a 6.7* polyacrylamide gel. 
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The recovered DNA is referred to herein as the 
Sst II/Hind III-HuGRF-A gene segment*. 

Sixty-two ng of the plasmid pTrpE-#22 DNA which had 
previously been cleaved with BssH II, Hind III and Sst II 
was ligated along with about 6.7 ng (0.14 pm) of the 
Sst II-HuGRF-A gene segment in a final volume of 20 yl 
with 1.0 units T4 DNA ligase in 66 mM Tris-HCl (pH 7.5), 
6.6 mM MgCl 2 , 10 mM dithiothreitol , 0.1 mM ATP, for 16 
hours at 12°C. The ligated DNA mixture was used for 
transformation of E. coli strain HblOl. Transf ormants 
were selected for ampicillin resistance on L broth plates 
containing 50 yg/ml ampicillin. After screening 24 
independently isolated colonies, eight were found to 
contain plasmid DNA with the additional Nde I site from 
the insert as well as the expected Sst II to Hind III 
insert of about 76 bp. Base sequence analysis according 
to Maxam and Gilbert verified that one of these had the 
expected sequence. This plasmid was designated 
pTrpE-Sst H-HuGRF Leu27 -A. 

To assemble the remainder of the HuGRF [_eu27 9 ene » 
six of the eight oligodeoxynucleotides comprising the B 
segment (B2-B7) were individually phosphorylated. 
Phosphorylations were terminated by heating to 70°C for 
10 minutes. Oligodeoxynucleotides Bl and B8 were not 
phosphorylated initially in order to prevent 
self-ligation at the termini. Two hundred picomoles of 
each of the phosphorylated oligodeoxynucleotides (B2-B7) 
were combined with 200 pm each of unphosphorylated Bl and 
B8. The oligomers were precipitated, annealed, and 
ligated as described above. The HuGRF-B gene segment was 
purified from unreacted and partially ligated DNA 
fragments by preparative electrophoresis on a 1.5% 
polyacrylamide gel. The recovered DNA was 
5 1 -phosphorylated. 



89821 



25160-FF 



0212532 



10 



15 



20 



30 



36 



-41- 



For the assembly of the A and B gene segments of 
HuGRF Leu27 , five micrograms of plasmid pTrpE»Sst II- 
HuGRF Leu27 -A was partially digested with 5 units of the 
restriction endonuclease BamH I. The reaction was 
terminated by phenol extraction and the ONA EtOH 
precipitated. The recovered DNA was digested to 
completion with ten units of the restriction endonuclease 
Hind III. The mixture was electrophoresed in a 0.7% 
agarose gel in order to resolve the partial cleavage 
products. The largest (6.216 Kb) fragment was excised 
and electroeluted. The recovered DNA was phenol 
extracted twice and EtOH precipitated. 

Sixty ng of the 6.2 Kb Hind III and BamH I- cleaved 
pTrpE-Sst ii - HuGrf l 6u 27 -A vector DNA was ligated 
along with an estimated five fold molar excess of the 72 
bp HuGRF-B gene segment. The ligated DNA mixture was 
used for transformation of E. coli strain HblOl. 
Transf ormants were selected for ampicillin resistance on 
L broth plates containing 50 yg/ml ampicillin. A 
number of the transf ormants were found to contain plasmid 
DNA with a Sst II to BamH I insert of about 0.5 Kb (the 
insert/terminator junction does not reconstitute a BamH I 
site) . Base sequence analysis according to Maxam and 
Gilbert, Proc. Nat. Acad. Sci. USA , 74, 560 (1977), 
revealed that one of the plasmids, designated 

pTrpE-Sst ii - HuGrf l 6u 27 had the desired HuGRF L e u27 
coding sequence. 

Transfer of HuGRF(Leu27) to pTrpLE-114 

The 0.5 Kb Sst II to BamH I insert comprised of the 
assembled A and B gene segments of HuGRF L eu 27 in 
addition to the transcription terminator, was purified 
from pTrpE*Sst II - HuGRF Leu27 DNA by preparative 
electrophoresis on a 5% polyacrylamide gel. 

60 ng of BssH II, BamH I and Sst II- cleaved 
pTrpLE-#14 plasmid DNA was ligated with about a five fold 
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molar excess of the 0.5 Kb Sst II to BamH I fragment. 

The ligated mixture was used for transformation of 

£• coli strain HblOl. Transf ormants were selected for 

ampicillin resistance on L broth plates containing 

50 yg/ml ,ampicillin. After screening 12 independently 

isolated colonies, 2 were found to contain plasmid DNA 

with the correct Sst II to BamH I fragment. Base 

sequence analysis according to Maxam and Gilbert, Proc. 

Nat. Acad. Sci. U.S.A. , 74, 560 (1977), verified that the 

plasmid pTrpLE-Sst II - HuGRF Leu27 had the desired 
sequence (Figure 3A). 

EXAMPLE 3 

Construction of pTrpLE 'BssH II-HuGRF(Leu27) 

Similarly, one may construct a gene to express 
TrpLE-HuGRF Leu27 using the unique BssH II site rather 
than the Sst II site in the TrpLE gene. 

Eight of the ten oligodeoxynucleotides comprising 
the A segment of HuGRF Leu27 ^ A2 throu 9 h A9 ) were 
individually phosphorylated. The reaction was terminated 
by heating to 70°D for 10 min. Oligodeoxynucleotides Al 
and A10 were not initially phosphorylated to prevent 
self-ligation at the termini. . 200 picomoles (pM) of each 
of the phosphorylated oligodeoxynucleotides (A2-A9) were 
combined with 200 pM of each of Al and A10. The 
oligomers were EtOH precipitated, annealed and ligated. 
The HuGRF-A gene segment was purified from unreacted and 
partially ligated DNA fragments by preparative 
electrophoresis on a 7.5% polyacrylamide gel. The 5' 
ends of the recovered HuGRF-A segment are then 
phosphorylated and ligated to plasmid pTrpE-#22, 
previously cleaved with BssH II and Hind III. The 
ligation mixture was used to transform E. coli HblOl. 
Transf ormants were screened for the presence of an 
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additional Nde I restriction site. The desired clone was 
designated pTrpE'BssH II-HuGRF-A. 

To complete the H uGRF Leu27 9 ene sequence, the 
HuGRF-B gene segment as well as the transcriptional 
terminator were utilized from pTrpE»Sst II- 
HuGRF Leu2? . Plasmid pTrpE*Sst "-HuGRF^^ was 
cleaved at the unique BamH I and Hind III sites. The 
0.4 Kb Hind III to BamH I fragment was purified by gel 
electrophoresis- Plasmid pTrpE*BssH II-HuGRF-A was 
cleaved with BamH I and Hind III and the largest fragment 
purified from a Q.7X agarose gel after electrophoresis. 
These two purified Hind III to BamH I fragments were 
ligated and transformed into E. coli HblOl. Plasmid DNA 
from the transf ormants was screened for the presence of 3 
Pst I sites (one Pst I site is conferred by the 
insert). By this technique plasmid pTrpE # BssH II- 

HuGRF Leu27 was identifi ed. Base sequence analysis 
(Maxam and Gilbert) revealed an unexpected single base 
pair deletion at the start of the BssH II-HuGRF-A gene 

segment. Thus, the sequence GCG CGA TG was found 

rather than the expected sequence GCG CGG ATG — . This 
resulted in the alteration of the triplet reading frame 
for the remainder of the HuGRF L eu 27 9 ene > which was 
repaired in the next step. 

The replacement of the TrpE DNA segment of plasmid 
pTrpE^BssH II-HuGRF Leu27 by the TrpLE DNA segment was 
achieved by joining portions of two previously described 
plasmids via a synthetic oligonucleotide. This 
oligonucleotide was designed to replace the HuGRF-A DNA 
sequence between the BssH II and Nde I sites in order to 
restore the proper translational reading frame for 
HuGRF Leu27 synthesis. To provide the heterologous 
HuGRF Leu27 codin 9 9 ene f I 2 - 5 ug of pTrpE'BssH II- 

HuGRF, _ „ DNA was cleaved with Nde I and BamH I and 
Leu27 

the desired 0.5 Kb fragment isolated from an agarose 
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gel. Next, the self-complimentary synthetic 
oligonucleotide TAGCGCGC was treated with T4 kinase, and 
300 ng was ligated to 0.5 ng of the isolated fragment. 
This reaction mixture was next heated to 60°C for 
15 minutes to inactivate the kinase, and then cleaved 
with BssH II to convert the Nde I end of the 0.5 Kb 
Nde I-BamH I fragment into a BssH II site, as illustrated 
below: 

Nde I BamH I 

GCG CG(G) ATG GTC CAT ATG TAC GGATCC— 

CGC GC(C) TAC CAG GTA TAC ATG CCTAGG— 

l«- 0.5 Kb + 1 

+ BamH I 
+ Nde I 
+ 

15 TATGTAC 'G 

ACATG CCTAG 

TAGCGCGC 
CGCGCGAT 

+ T4 Kinase 

20 + < — - 

TAGCGCGCTATGTAC G 

CGCGCGATACATG CCTAG 

+ BssH II 

CGCGCTATGTAC G 

25 GATACATG CCTAG 

The TrpLE coding gene was provided by digestion of 
plasmid pTrpLE-#14 with BssH II and BamH I. A ligation 
between 100 ng of the 0.5 Kb BssH II-BamH I fragment and 
100 ng of the double digested pTrpLE-#14 DNA was 
performed. The ligation products were transformed into 
competent E. coli HblOl cells and plated onto ampicillin 
plates. Restriction enzyme screening of DNA from the 
resulting colonies by digestion with BssH II plus BamH I 
or with Pst I identified 7 out of 24 candidates as having 

35 
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the anticipated structure. The DNA sequence of plasmid 
pTrpLE»BssH U-HuGRF Leu27 -#5 was confirmed by DNA 
sequence analysis (Figure 3B) . 

EXAMPLE 4 

Expression and Purification of LE-HuGRF(Leu27) 

The procedure described herein can be used to purify 
HuGRF from the LE-HPGRF Leu27 clones which utilize 
either the BssH II or Sac II fusion site to the TrpLE * 
protein. Before beginning, the extent of induction can 
be monitored by analysis of an aliquot of the washed cell 
pellet on a stained SDS gel. There is no difference in 
the level of expression or the purification steps up to 
the point of dialysis with either clone (Figure 4). 
During dialysis, the BssH II clone deposits a large 
precipitate, whereas the Sac II clone forms a smaller 
precipitate. Centrif ugation of the BssH II dialysate 
thus leaves a larger pellet which must be thoroughly 
washed to fully recover the LE-HPGRF Leu27 . 
Induction of TrpLE-HuGRF 

In this case, a starter culture of 50 ml of LBr 
culture media (reference to the preparation of the LBr 
and M9 media described here can be found in J. H. Miller 
(1972) "Experiments in Molecular Genetics," p. 431) 
containing 50 pg/ml of ampicillin and 100 pg/ml of 
tryptophan was inoculated with the E. coli cells 
containing the plasmid, i.e., pTrpLE •BssH II- 
HuGRF Leu2? , and grown overnight at 37°C with vigorous 
shaking to a final A 55Q of about 6. Four liters of M9 
culture media for production were pre-warmed to 37°C and 
seeded with 40 ml of the LBr starter culture to give a 
final A 55Q of about 0.6. The culture is then grown 
with vigorous shaking to an A 55Q of about 0.2 to 0.5, 
whereupon 4. ml of a 10 mg/ml solution of indole acrylic 
acid (IAA) in EtOH is added to give a final concentration 
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of 10 ug/ml solution of IAA. Growth is then continued 
with good aeration for about 16 hr to a final A 55Q of 
about 5 (typically, about 5 to 10), The addition of IAA 
during growth in M9 media is optional since it increases 
only the kinetics of product accumulation and not the 
final yield. An alternate and equally effective method 
of cell growth is to substitute LBr for M9, in which case 
the addition of IAA is essential for adequate product 
yields. The cells were concentrated by centrif ugation, 
resuspended in 500 ml (typically 1/10 to 1/20 of the 
original culture volume) of 50 mM Tris-HCl, pH 7.1, and 
re-pelleted. The washed cells may be conveniently stored 
at -20°C at this stage. 
Sonlcatlon 

The washed cells are lysed by sonic disruption. 
Cells were resuspended in 900 ml (typically 20-100 A 55Q 
units per ml) of 50 mM Tris-HCl, pH 7.5, 0.1 mM EDTA, 
1 mM phenylmethylsulfonyl fluoride (PMSF) (this buffer 
solution will be abbreviated "TE-PMSF" ) , cooled to 4°C, a 
few drops of octanol added and N 2 gas bubbled through 
the solution. These steps serve to minimize heating of 
the sample and oxidation of methionine residues during 
sonication. The suspension was sonicated using a model 
W220F Heat Systems-Ultrasonics, Inc. sonicator (equipped 
with a 3/4" horn) according to the manufacturer's 
recommendations. The sonicator was typically operated 
only 50% of the total process time to avoid overheating. 
Centrif ugation 

The cell lysate was centrifuged for 15 min at about 
3600 x g max to pellet the pTrpLE-HuGRF Leu27 and the 
supernatant discarded. The pellet was resuspended in 
450 ml TE-PMSF (typically 40-80 A 55CJ /ml) and the pellet 
collected by centrif ugation as before. 
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EXAMPLE 5 
Purification of HuGRF(Leu27) 

Cleavage of the methionine residues in the 
TrpLE-HuGRF Leu27 fusion protein with CNBr causes 
release of the desired heterologous H uGRF Leu27 protein, 
which can be purified as described below. 
CNBr Treatment of Pellet 

The washed pellet of TrpLE-HuGRF Leu2? was 
resuspended by gently stirring or swirling it in 60 ml 
70% formic acid (about 20 mg/ml total protein; typically 
material from 1000 A 55Q units of cells is dissolved in 
3 ml). A few drops of octanol were added and N 2 
bubbled through the solution for 20 minutes before adding 
5.5 g CNBr. This reaction was allowed to proceed for 
6 hours at 25°C before the addition of an equal volume of 
50:50 MeOH:H 2 0 in order to facilitate the subsequent 
concentration step. The volatile excess CNBr, water and 
organic solvents were removed by rotary evaporation until 
the volume was reduced by one-half. Additional 50:50 
Me0H:H 2 0 was mixed with the sample and subsequently 
removed by rotary evaporation. After 2-4 repetitions of 
this process, the bulk of the formic acid and CNBr had 
been removed. The sample was then evaporated to dryness, 
redissolved in 200 ml of water and lyophilized. 
Acetic Acid Extraction 

The lyophilized sample was dissolved in 80 ml IN 
acetic acid, and extracted for at least 30 minutes at 
25°C. After centrifugation for 10 minutes at 12,000 x 
9 max the supernatant was recovered and the pellet 
reextracted with 70 ml 1 N acetic acid as before. The 
pellet fraction was discarded and the supernatants, which 
contain the H ^ GRF L eu 27> were combined. 
Dialysis 

The IN HOAc extract was next placed into dialysis 
tubing having a molecular weight cutoff of 3500 and 
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dialyzed for a total of 18 h against four changes of 
10 mM NaOAc, pH 5,0, 4 L each. The first half of the 
dialysis was performed at 25°C, and the second half at 
4°C. The precipitate which formed was removed by 
centrif ugation for 10 min- at 10,000 x g max - If 
5 warranted, the pellet was washed with buffer to maximize 
recovery of HuGRF in the soluble fraction. 
CM-Cellulose Chromatography 

The clarified dialysate was then passed through a 
50 ml bed volume CM-cellulose column (2.5x10 cm) at 

10 1.5 ml/min in a cold room (Q-4°C). After washing with 
50 mM NaOAc, pH 5, a gradient from 50 to 500 mM NaOAc, 
pH 5 was used to elute the HuGRF Leu27 # The P eak °f 
HuGRF Leu27 could be determined by immunoblotting 
(western analysis) of the column fractions. 

15 HPLC 

Pooled fractions from the CM-cellulose column were 
filtered through a milex-GV 0.22 pm filter (Milipore) 
and applied at 4 ml/min directly onto an 0.8x25 cm HPLC 
column packed with Vydec C18 TP201 N-capped (10 ym) 
20 300 A pore beads which had been equilibrated in solvent 
A (0.1% trif luoroacetic acid in water). The column was 
then washed with 75:25 solvent A:solvent B (0.1% 
trif luoroacetic acid in acetonitrile) until the baseline 

returned to zero. The HuGRF L e u27 was then eluted with 
25 a gradient running from 25 to 35% solvent B over a period 
of 30 min. The major peak in the chromatogram, which 
contained the HuGRF, eluted after about 26 minutes and 
was collected. The solvent was removed by 3-4 cycles of 
rotary evaporation from water. 
30 Gel Filtration 

As a final purification step, and in order to 
replace any remaining traces of CF 3 C0 2 - with C1-, the 
dried sample was dissolved in 4 ml of 25 mM HC1 and 
applied to a 25 ml column (2.5x5cm) of Sephadex G-50 
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which had been equilibrated in the solvent. The column 
effluent was monitored by absorbance at 280 nm and the 
void volume peak containing the HuGRF was pooled. This 
material was lyophilized several times from water until 
the pH of the solution exceeded 5. A white, fluffy 
material was obtained. 

EXAMPLE 6 

Characterization And Biological Activity of HuGRF(Leu27) 
A purity of greater than 95% for the final 

HuGRF Leu27 P roduct was demonstrated by SOS gels (Burr 
and Burr, Meth. Enz. . 96, 239-244 (1983)) either stained 
for total protein or visualized by western blotting with 
anti-GRF antisera (Figure 9). In addition, an HPLC 
profile was obtained (Figure 10). The measured dry 
weight of the final product and the theoretical 'weight 
calculated from the total amino acid content agreed to 
within 5%. Further, the amino acid composition and 
sequence agreed with the predicted results. 

The 44-amino acid peptide GRF induces secretion of 
growth hormone from the anterior pituitary gland. The 
mechanism of action involves a receptor-mediated 
activation of adenylate cyclase and elevation of 
intracellular cyclic AMP (Labrie, F., 8. Gagne, and G. 
Lefevere, Life Sciences . 33, 2229-2223 (1983)). The 
biological potency of the HuGRF L eu 27 was therefore 
demonstrated by its ability to activate adenylate 
cyclase, by its specific binding to pituitary cells, and 
by its ability to stimulate the release of GRF from 
pituitary cells. 

Adenylate Cyclase Enzyme Activation Assay 

To assay adenylate cyclase activation by GRF we 
chose bovine pituitary glands. Pituitary membranes were 
lyophilized and enzyme activity was assayed in accordance 
with established procedures (Alvarez, R. f and J.J. Bruno, 
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Proc. Nat. Acad. Sci. USA , 74, 92-95 (1977)). The 
HuGRF Leu27 had an EC 5Q of 2 uM. The shape of the 
response curve and the molar potency were similar to that 
of authentic HuGRF-NH^. We concluded that the peptide 
produced by recombinant DNA techniques has definite 
5 biochemical activity. 
Binding Assay 

The specific binding of GRF to pituitary cells was 
determined as follows: 

GH3 rat pituitary tumor cells, a somatotrophic cell 
10 line which was obtained from Dr. D. Gospodarowicz 

(University of California, San Francisco), were grown in 
Dulbecco's minimal essential medium (DMEM) . Synthetic 
HuGRF-NH 0 was purchased from Peninsula Labs (San 



15 
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30 



125 

Carlos, CA) and radioactively labeled with Na[ I] 
using Enzymobeads (BioRad) to a specific activity of 
20,000 cpm/ng protein. Cells were grown to confluence in 
24-well Costar trays washed with DMEM buffered at pH 7.4 
with 20 mM HEPES and containing 0.135 bovine serum albumin 
(DMEM-BSA) . Cells were incubated at 4°C for one hour 
with 2 x 10" 9 M [ 125 I]-HuGRF(l-44) in either the 
absence or presence of increasing concentrations of 
recombinant GRF analogs in a final volume of 250 ul 
DMEM-BSA. Cells were washed three times with DMEM-BSA, 
lysed with 0.1 N NaOH and the cell-associated 
radioactivity was determined in a gamma counter. 
Nonspecific binding was assessed in the presence of 
10" 7 M synthetic HuGRF-NH^ and did not exceed 15* of 
the specific binding. 

In independent experiments it was established that 
GH3 cells possess specific receptors for HuGRF-NH 2 . 
Binding was saturable and reversible. Scatchard analysis 
of this data revealed about 20,000 binding sites per cell 

with an apparent K . for hpGRF-(l-44) of about 

9-1 
1.0 x 10* M . 
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The recombinant human and porcine GRF were almost 
equipotent with synthetic human GRF (2 to 3-fold reduced 
affinity) for their receptor binding ability. 
Growth Hormone Secretion Assay 

The ability of GRF to stimulate the release of 
growth hormone from pituitary cells was also assayed. 
Normal rat anterior pituitary cells were established in 
primary cultures as follows: Male adult Sprague-Dawiey 
rats (250 g) were sacrificed by decapitation. The 
anterior pituitary was dissected, placed in DMEM 
containing 1% BSA and minced into 1 mm pieces. After 
several washes with DMEM-BSA, the tissue was incubated at 
37°C for 35 minutes with 0.2 U trypsin/ml and 1 mg/ml 
ONAase/ml with gentle shaking. Tissue fragments were 
15 then further dispersed by physical shearing with a 

Pasteur pipette. Large fragments were allowed to settle 
and were removed. Cells in the supernatant were then 
pelleted, washed and plated for tissue culture for A days 
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30 



in DMEM + 10% FCS before an experiment. The average 
yield of viable cells was approximately 2 x 10 6 cells 
per pituitary. Cells were plated in 24-well dishes and 
incubated with GRF or analogs for 3 hours at 37°C. 

The concentration of GH in the medium was determined 
by a solid phase RIA using reagents from the NIH 
Pituitary Agency. Ninety-six well poly viny lchloride 
plates were coated overnight at 4°C with rabbit 
anti-monkey ig antibodies (Pel-Freez) at 0.1 mg/ml in 
phosphate buffered saline (PBS). Plates were 
countercoated with a 3% BSA solution in PBS for at least 
one hour at 37°C. Plates were washed and incubated with 
monkey anti-rat GH antiserum (1/10,000 final dilution) 
for 2 hours or more at 37°C. After several washes with 
PBS-0.1% BSA, plates were incubated at 37°C for 2 hours 
with 20 ng/ml [ 125 I]-GH in the presence or absence of 
unlabeled GH or dilutions of cellular conditioned 
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medium. After several washes, plates were dried and 
individual wells were cut out and counted in a gamma 
counter. 

The recombinant human and porcine GRF appeared 
equipotent to and as efficacious as synthetic human GRF 
5 for the induction of GH release. 

EXAMPLE 7 

Construction And Use Of pTrpLE-PGRF(Leu27) 

Using an approach and methods similar to that 

10 described in Examples 2-3, plasmids were constructed in 
which either the unique BssH II site or the unique Sst II 
site of the pTrpLE-#14 vector was utilized to encode for 
a fusion protein between the TrpLE protein and the 
PGRF Leu27 ana l°9 ue °f PGRF. This was accomplished by 

15 replacing that portion of the synthetic DNA encoding 

HuGRF located between the Sal I site within the HuGRF DNA 
segment and the BamH I site at the proximal end of the 
terminator DNA segment by synthetic DNA encoding PGRF. 
The replacement of the HuGRF synthetic DNA by the PGRF 

20 synthetic DNA was facilitated by first constructing 
plasmid pPS#8 in which the Sal I site within the HuGRF 
gene and the BamH I site just beyond its end were both 
unique. 

Construction of vector plasmid pPS#8 

25 The BamH I site in plasmid pTrpE-#22 which lies the 

farthest from the TrpLE gene was first removed by 
digesting the plasmid with BamH I and Sal I, EtOH 
precipitating the DNA, and treating the mixture with T4 
ligase. (BamH I cuts at each end of the terminator DNA 

30 segment, and Sal I cuts within the pBR322 DNA region and 
within the terminator, nearest the Hind III site.) The 
ligation products were treated with Sph I (which cuts 
within the pBR322 DNA region) and transformed into 
competent HblOl cells. One of the resulting plasmids, 
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pVVAHinASal#15, was shown to contain the 0.33 Kb 
BamH I to Sal I fragment from the transcriptional 
terminator region of pTrpE-#22 flanked by unique BamH I 
and Sal I sites. This fragment was in the opposite 
orientation in pVVAHinASal#15 compared to pTrpE-#22, 
but it functions effectively in both orientations. 

The Sal I site in pWAHinASal#15 was next 
removed by deleting the DNA between the Sal I and the 
Pvu II sites of the pBR322 portion of the vector. This 
was accomplished by digesting pVVAHinASal#15 with 
Sal I and Pvu II and filling in the Sal I end by 
incubating with DNA polymerase I Klenow fragment. The 
resulting DNA was then cleaved with Sal I and transformed 
into HblOl cells. The resulting plasmid, pPS#3, 
contained the expected deletion and lacked a Sal I site. 
Since the procedure described would be expected 'to 
regenerate a Sal I site, the lack of a Sal I site in 
pPS#3 probably reflects the increased transformation 
efficiency of circular ONA lacking a Sal I site, as 
compared with Sal I-cleaved linear DNA. 

Next, the 1.35 Kb EcoR I to BamH I fragment encoding 
TrpLE-HuGRF Leu2? was excised from pTrpLE«BssH II- 
HuGRF#5 and ligated with EcoR I and BamH I cleaved pPS#3 
DNA. The ligation mixture was treated with BstX I, which 
cleaves only the pPS#3 ONA fragment not desired in the 
final construct, and transformed into competent HblOl 
cells. The resulting plasmid, pPS#8 f containing a gene 
encoding the TrpLE-HuGRF protein, with its Sal I site, 
followed by a transcriptional terminator DNA segment, 
also containing a Sal I site, followed by a unique BamH I 
site, followed by a transcriptional terminator DNA 
segment lacking a Sal I site. 
Insertion Of Synthetic DNA Encoding PGRF 

The three amino acid substitutions which distinguish 
PGRF from HuGRF all lie beyond the Sal I site in the 
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HuGRF Leu27 gene sequence of pPS#8. Since the sequence 
beyond the BamH I site of pPS#8 contained a functional 
transcriptional terminator, synthetic DNA encoding PGRF 
was designed to replace the DNA between these two sites 
in pPS#8. Six oligonucleotides were synthesized, 
ligated, cleaved with Sal I and BamH I, and the 54 bp 
oligonucleotide illustrated below isolated from a 5% 
polyacrylamide gel: 

10 Arg Gin Gin Gly Glu Arg Asn Gin Glu Gin Gly Ala Arg 

CI C3 C5 

— I UGA CAA CAG UGU GUC CGT AAC CAG GAG CAG 5GTC GCC AGG 
(Sal I) GTT GTC CCG CGG GCA TTG GTC CTC GTC CCG CGG TCC 
Z2 TT5 



1S Val Arg Leu STOP 
C5 Xba I 

GTG CGG CTC TAG AG (BamH I) 
CA C GCC GAG ATC TCC TAG 

This 54 bp fragment was ligated to pPS#8 which had 
20 been previously cleaved with Sal I and BamH I and the 
resultant clones screened for the presence of the unique 
Xba I site which was designed into the synthetic PGRF 
DNA. Plasmid pTrpLE # BssH II-PGRF#10 was thus 
identified, followed by experimental verification of the 
25 DNA sequence. 

By using a similar approach with a vector analogous 
to pPS#8, but containing a HuGRF gene fused to the Sac II 
site of TrpLE, the plasmid pTrpLE'Sac II-PGRF was 
constructed. 

3Q Purification and Biological Activity of PGRF(Leu27) 
Using the procedures described in Examples 4-6, 
E. coli cells (strain W3110 prototroph, ATCC No. 27325) 
containing plasmids encoding the T *pLE-PGRF Leu27 fusion 
protein were grown, expression of the fusion protein 

35 induced, and the PGRF Leu 27 released by CNBr treatment 
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and purified. The purity and biological activity of the 
final product were comparable to that described for 

HuGRF Leu27' 

EXAMPLE 8 

Construction and Use of a pTrpLE-BGRF (Leu27) Vector 

Following the approach outlined above, the carboxy 
terminal encoding region of the HuGRF portion of plasmid 
pPS08 was altered to encode the BGRF Leu27 analog of 
BGRF. Since the five amino acid differences which 
distinguish BGRF from HuGrf l 6U 27 a11 lie be y° nd the 
Hind III site in the HuGRF Leu27 codin 9 region of 
plasmid pPS#8, eight oligonucleotides were designed to 
encode the BGRF amino acid sequence and replace the ONA 
fragment between Hind III and BamH I. After these 
oligonucleotides were synthesized, phosphorylated, ligated and cloned 
into pUC8 f cleavage with Hind III and BamH I produced the 
following 75 bp DNA fragment: 



Leu Leu Gin Asp lie Leu Asn Arg Gin Gin Gly Glu Arg 

Dl D3 05 

5' AG CTT CTG CAG GAC ATT CTT AAC CGA CAA CAG GGC GAA UGT 
3' A GAC GTC CTG TAA GAA TTG GCT GTT GTC CCG CTT GCA 

Hind III 02 04 



25 Asn Gin Glu Gin Gly Ala Lys Val Arg Leu STOP 

05 07 BamH I 

AAC CAG GAG CAG GGC GCC AAG GTG CGG CTC TAG AG 

TTG^ GTC CTC GTC CCG CGG TTC CA C GCC GAG ATC TCC TAG 
015 " D"8 

This fragment was next ligated into Hind III and BamH I 
30 cleaved plasmid pPS#8 to produce the desired plasmid, 

pTrpLE-BssH H-BGRF Leu27 #15. 

The procedures described in Examples 4-6 were used 

to grow E. coli cells (strain W3110 prototroph, ATCC No. 

27325) containing plasmid pTrpLE # BssH II- 
35 BGRF, 27 #15, express the fusion protein, purify the 
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BGRF Leu27 released by CNBr cleavage and test the 
biological activity. Results demonstrated activity 
comparable to HuGRF L e u27 and PGRF Leu27" 

EXAMPLE 9 

5 Formulations 

HuGRF Leu27' BGRF Leu27' « PGRF Leu27 ls 
dissolved or suspended in physiological saline at. a 

concentration of about 1 to 5 yg/ml. Other 

pharmaceutical^ acceptable excipients may be added if 

10 desired. Two ml of the resulting solution is 

administered by intramuscular or subcutaneous injection 
one to four times per day. A typical mammalian dose is 
about 0.1 to 0.5 pg per Kg body weight per day. 
Alternatively, H uGRF Leu2?> BGRF Leu2? , or 

1 5 PGRF Leu27 ma y be disso l v ed or suspended in 

physiological saline and the solution or suspension 
installed in a controlled release device for continuous 
administration. A typical continuous release rate is 
from about 1 to about 5 ug per hour per Kg body weight. 

20 

EXAMPLE 10 

Construction of 3 open-reading-frame vectors 

In order to construct a set of three vectors 
suitable for expression of any open reading frame as a 

25 fusion to the TrpLE protein, oligonucleotides encoding 
the BamH I recognition sequence GGATCC were inserted near 
the unique BssH II site of the TrpLE gene in each of the 
three possible translational reading frames relative to 
the TrpLE coding gene. 

30 To provide a substrate for the ligation of these 

oligonucleotides, plasmid pPS#8 ONA (2 ug) (See 
Example 7) was cleaved at 50°C for one hour with 4 units 
of the restriction endonuclease BssH II. The reaction 
was terminated by extracting with phenol and the DNA EtOH 
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precipitated. To fill the BssH II-generated 5 f 
overhangs, the ONA was resuspended in 20 yl of buffer 
containing 40 mM KHPO^ (pH 7.5), 6.6 mM MgCl 2 , 1.0 mM 
2-mercaptoethanol, 250 uM of each of the four 
deoxynucleotides. 1 unit of Klenow fragment (E. coli DNA 
* polymerase large fragment) was added and the mixture 
incubated at room temperature for 15 minutes. 

To construct the three reading frame vectors, the 
flush-ended BssH II-cleaved pPS#8 DNA was divided into 
three reactions. Each of the reactions contained 

10 0.5 yg of ONA in ligation buffer [50 mM Tris-HCl 

(pH 7.8), 10 mM MgCl 2 , 20 mM dithiothreitol], 100 uM 
ATP, 2 units of T4 ligase and a 100-fold excess of one of 
the three synthetic BamH I linkers (CGGATCCG , CCGGATCCGG 
or CCCGGATCCGGG) . After overnight incubation at 12°C, 

15 the reaction mixtures were adjusted to 150 mM NaCl, 6 mM 
Tris-HCl (pH 7.9), and 6mM MgCl 2 and treated with 1 
unit of BamH I. After 1 hour's incubation at 37°C the 
reactions were terminated by phenol extraction. ONA was 
recovered from each reaction by EtOH precipitation. Each 

20 

precipitated ONA was resuspended in 10 \xl of ligation 
buffer and treated with 1 unit of T4 ligase. The 
ligation mixtures were then used to transform competent 
E. coli HblOl cells. Progeny clones, in which a BamH I 
site lies between the BssH II site of TrpLE (6CGCGC) and 
25 the terminator DNA segment, were designated as pSB8, 
pSBlO and pSB12. Nucleotide sequencing verified that 
these three vectors contain the appropriate BamH I 
linkers, according to the following scheme: 

30 

BamH I 

pSB8 CTG CGC GCG GAT CCG TCG 

Leu Arg Ala Asp Pro Ser 
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pSBlO 



BamH I 

-CTG CGC GCC GGA TCC GTC 
-Leu Arg Ala Gly Ser Val 



BamH I 



pSB12 



-CTG CGC GCC CGG ATC CGT 
-Leu Arg Ala Arg lie Arg 



5 



EXAMPLE 11 



Construction of Genomic Library for AIDS 

The construction of a library of clones capable of 
expressing open reading frames of the AIDS virus fused to 

10 the TrpLE protein was achieved by ligation of a partial 
Sau3A digest of AIDS DNA into the BamH I site of the 
three open reading frame vectors, pSB8, pSBlO, and 
pSB12. A partial Sau3A digest of AIDS DNA was used to 
increase the chances that large open reading frames, 

15 which may be interrupted by one or rftore Sau3A sites, will 
be represented intact. 

Plasjnid pBenn#20 (10 yg) was aoubly digested with 
restriction enzymes BamH I and Sst I (10 units each) 
using the buffer conditions recommended for BamH I at 

20 370c for 90 minutes. The resulting DNA fragments were 
resolved on an 0.7% preparative agarose gel. An 8 Kb 
fragment which encompassed four fifths of the AIDS viral, 
genome was eluted from the agarose gel and concentrated 
by EtOH precipitation. The resuspended DNA (in buffer 

25 containing 50 mM NaCl, 6 mM Tris-HCl (pH 7.5), and 5 mM 
MgCl 2 ) was divided into three reactions, each of which 
was digested with 0.1 or 0.01 units of Sau3A at room 
temperature for 15 min. After phenol extraction to 
terminate the reactions, the extent of Sau3A digestion 

30 was monitored by analyzing the cleavage products on an 
agarose gel. The DNA sample treated with 0.1 unit of 
Sau3A showed the appropriate extent of partial digestion 
and was chosen for subsequent ligation to the three 
open-reading frame vectors. Approximately 200 ng of the 
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partially Sau3A-digested DNA was mixed with 100 ng each 

of the three vectors that had been previously treated 

with BamH I and alkaline phosphatase in 30 pi of 

ligation buffer. One unit of T4 ligase was added to each 

g of the reactions and the mixtures were incubated for 6 

hours at 12°C. The ligated DNA from each of the three 

ligation reactions was then separately transformed into 

competent E. coli HblOl cells. 

A total of approximately 400 clones were obtained 

mmk from the initial transformation. Greater than 80% of 
10 

these clones contained AIDS viral DNA inserts as 
evidenced by their positive hybridization to fragments of 
pBenn#20. Progeny clones were further screened for their 
ability to produce novel protein molecules that were 
immunoreactive with anti-AlDS virus sera obtained from 
A10S patients (Figure 12). Nucleotide sequencing was 
performed on several clones to determine the sequences at 
each vector-insert junction. The results (Table 1, 
Figure 11) verified that the positive clones contained 
AIDS viral fragments inserted in the proper orientation 

20 

and correct reading frames with regard to the 
transcription and translation of the LE carrier protein. 
Moreover, the origins. and boundaries of the inserts 
within the AIDS viral genome were defined, using 
published AIDS viral DNA sequences as a reference. (S. 
26 Wain-Hobson, et al., Cell , 40, 9-17 (1985); R. 

Sanchez-Pescador, et al., Science , 227 , 484-492 (1985); 
L. Ratner, et al., Nature , 313 , 277-284 (1985); 
M.Muesing, et al.. Nature , 313 , 450-458 (1985)). 

30 

EXAMPLE 12 

Construction of other specific AIDS viral clones 

In addition to the AIDS genomic library described 
above, which was based on the insertion of random Sau3A 
3g fragments into the three open-reading-frames vectors, two 
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additional clones containing specific DNA fragments were 
constructed (Table 1, Figure 11). 

The first specific construct utilized a pTrpLE- 
derived vector (designated pTrpLE-KAL) . pTrpLE-KAL was con- 
structed by cleaving pTrpLE-#l4 with BssH II and BamH I, 
5 followed by the introduction of oligonucleotides to read as 
follows: 

(former) 

BssH II Kpn I BamH I 

10 CTG CGC GAT GAT GAT GAT AAG GTA CCG GAT CCG 
Asp Asp Asp Asp Lys 

The introduced adapter contains the cleavage sites for 
both Kpn I and BamH I as well as coding fcr t-he 

15 pentapeptide Asp-Asp-Asp-Asp-Lys , a peptidyl linkage 

specifically recognized by the endopeptidase enterokinase . 

Using standard laboratory procedures, a 2.15 Kb 
Kpn I to BamH I fragment of plasmid pBenn02O, a fragment 
which spans the major portion of the ENV gene, was 

20 transferred onto plasmid pTrpLE-KAL via the same two 
restriction sites. The resultant plasmid, designated 
pTrpLE-KAL-ENV-#10, upon induction produced a 
reproducible pattern of protein products on SOS gels that 
were strongly reactive with the anti-AIDS sera 

25 (Figure 12). 

The other specific clone was constructed so as to 
contain a 950 base pair fragment between the Pvu II and 
the Bgl II sites within the GAG gene of the AIDS virus. 
To construct this clone, a 1.1 Kb Sau3A fragment which 

30 encompasses the coding genes for the central region of 
the GAG open-reading frame was purified and cloned into 
the unique BamH I site of pSB8. We chose to use this 
particular vector in order to insert the Sau3A fragment 
in the "wrong" translational reading frame in reference 
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to the LE protein. Our previous experience showed that 
attempts to insert this fragment in the "correct" reading 
frame would not result in proper translation. By using 
the "wrong" reading frame of pSB, many transf ormants 
containing the desired insertion were obtained. These 
were designated pSBB-GAGl.l. To regenerate the correct 
translational reading frame, pSB8-GAGl.l DNA was cleaved 
with Sst II (within the LE) and Pvu II (about 130 bp into 
the Sau3A fragment). The 3' overhangs of the Sst II site 
were then removed using DNA polymerase I as described 
above, and the two blunt ends were recircularized using 
T4 ligase. Upon induction, one of the progeny 
transf ormants, pS8-Gag-#25, produced multiple 
polypeptides that were immuno-reactive with the anti-AIDS 
sera (Figure 12) . 

EXAMPLE 13 

Analysis of AIDS Patient Sera with LE-AIOS Fusion Proteins 

Western blotting analysis was used to demonstrate 
the usefulness of the different recombinant viral 
proteins produced in E. coli in detecting antibodies 
present in AIDS patients' sera. Serum samples were 
obtained random healthy blood donors, persons at high 
risk for contracting AIDS and patients suffering from 
lymphadenopathy, ARCS or AIDS. Extracts of the E. coll 
pSB8 vector alone, as a negative control, and of AIDS 
infected human lymphocytes, as a positive control, were 
electrophoresed and included in the assay of each serum. 
Figure 13 shows representative data from 19 serum samples. 

A strong correlation between positive test results 
and AIDS, ARCS, and LAV infection was demonstrated. 

Only a subset of the sera testing positive for 
antibodies to the AIDS virus and/or the ENV clone reacted 
with the GAG clones (pSB8-#45, pSB-Gag-#25 and pSB12-#31) 
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or with the POL clones (pSB12-#67, pSB12-#68 and 
pSB12-#33). 

The western blot analysis was performed as follows: 
E. coli extracts or AIDS virus-infected cell extracts 
(0.4 mg) were applied to SOS polyacrylamide gels (20 cm 
wide x' 14 cm high x 0.15 cm) and electrophoresed 
according to the method of Laemmeli ( Nature , 277 , 680 
(1970)), Western blot transfer was performed as 
described by Burnette ( Anal. Bioch. , 112 , 195-203 
(1981)). The blotted nitrocellulose (Schleicher & 
Schuell, BA83, 0.2 pore size) filters were then 
blocked, initially at 42°C for 30 min with TN buffer 
(lOmM Tris-HCl, pH 7.4, and 155 mM NaCl) containing 5* 
non-fat dry milk, and subsequently at room temperature 
for 10 min with TN-TN buffer (10 mM Tris-HCl, pH 7.4, 
155 mM NaCl, 0.3* Tween-20, 0.05* Nonidet P-40) 
containing 1* BSA. The nitrocellulose filters were then 
sliced longitudinally into strips of 1/4 inch in width. 
Serum samples were obtained from either healthy blood 
donors, individuals from "high-risk" groups, or patients 
with lymphadenopathy , ARC or AIDS. Serum from each 
individual was diluted 1000-fold in TN-TN-3X BSA and 8 ml 
were added to each separate chamber of a multi-trough 
plastic incubation tray. A set of nitrocellulose strips 
containing protein extracts derived from various E. coli 
clones or AIDS virus infected cells was placed into each 
chamber. Incubation was carried out at 4°C overnight 
with constant rocking. Upon removal of the sera, the 
strips were washed twice with 25 ml TN-TN buffer (5 
minutes each) and twice with 25 ml TN-T buffer (TN-TN 
without NP-40) . The filters were then incubated with 8 
ml of diluted radioactively labelled protein A (200,000 
dpm/ml of [ 125 I] protein A in TN-T-3S BSA). Incubation 
was carried out at room temperature for at least 
45 minutes with constant rocking. The strips were then 
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washed three times with 25 mi of TN-TN plus lOmM EDTA 
buffer followed by one wash with 25 ml of TN buffer 
(5 minutes each wash). The washed nitrocellulose strips 
were then air-dried and exposed to X-ray films. 

EXAMPLE 14 
Expression of LE-ENV and LE-GAG 

A starter culture of 50 ml LB culture media 
containing 50 ug/ml ampicillin and 100 yg/ml 
tryptophan was innoculated with E. coli cells containing 
either the pTrpLE-KAL-ENV-#10 or the pSB8-Gag-#25 
plasmid. The cultures were grown overnight, sonicated, 
centrifuged, CNBr cleaved, and extracted with HOAc and 
dialyzed as described in Example 4 above. 

EXAMPLE 15 

Immunoreactivity of CNBr-Cleaved ENV Peptides 

Western Blot analysis was used to determine the 
immunoreactivities of the various CNBr polypeptide 
fragments using anti-AIDS antibodies. Protein samples 
from induced cultures containing pTrpLE-KAL-ENV-#10 or 
pSB8-Gag-#25, with or without CNBr treatment, were 
applied to SDS-polyacrylamide gels and electrophoresed. 
The method of Burr and Burr ( Meth. Enz. , 96, 239-245) was 
used for CNBr-treated samples, and the method of Laemmeli 
( Nature , 277 , 680 (1970)) was used for untreated 
samples. Western blot transfer was performed as 
described in Example 13. 

Western blot results indicated that while CNBr 
treatment of LE-GAG fusion proteins generated by 
pSB8-Gag-#25 completely destroyed any immunoreactive 
epitope, similar chemical treatment of LE-ENV fusion 
proteins produced by pTrpLE-KAL-ENV-#10 resulted in 
peptides that retained strong immunoreactivity 
(Figure 14). In fact, these peptides appear to react 
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with anti-AIDS sera somewhat stronger immunologically 
than the intact fusion proteins. 

EXAMPLE 16 

Construction of High-Expression ENV Clone 

The above results suggested that deletion mutants of 
Kal-10 which retain the coding sequences for important 
antigenic domains would be useful for diagnostic 
applications. Deletion of extraneous gene sequences, 
such as those that code for membrane binding domains, 
often result in progeny clones that are capable of 
synthesizing proteins of interest to a higher level. 
Following this principle, we constructed a new 
plasmid (designated pENV-B54AHB) as follows: 
An 854 bp Bgl I I to BamH I fragment of 
15 pTrpLE-KAL-ENV-#10 was first cloned into the unique 
BamH I site of plasmid pS812, yielding plasmid 
pSB12-854. Plasmid pSB12-854 DNA was further cleaved 
with Hind III and BamH I. The 5 1 overhangs were then 
filled using Klenow fragment and the resulting blunt ends 

20 

recircularized using T4 ligase to produce the sequence 
AAGC TGA TCC. The t ranslational termination codon (TGA) 
generated by this process terminates the LE-ENV fusion 
protein at the Hind III-BamH I fusion site. The progeny 
transf ormant, pSB12-854AHB produced on induction a 
25 discrete 35 Kd polypeptide to more than 5% of the total 
cellular protein, as observed on an SDS-polyacrylamide 
gel stained with Coomasie Blue (Figure 15A). Western 
blot analysis of the total E. coli extract of 
PSB-854AHB verified that the 35 Kd protein was the only 

30 

immunoreactive protein entity (Figure 15B). 

The 35 Kd protein was subjected to CNBr cleavage, 
following the procedure set forth above. Western blot 
analysis demonstrated that the resulting polypeptide 
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fragments retain excellent immunoreactivity with 
anti-AIDS antibodies. 

EXAMPLE 17 

HPLC Fractionation of CNBr-ENV Peptide Fragments 

The CNBr-cleaved fragments obtained in Example 16 
were applied to a 0.46 x 25 cm HPLC column packed with 
C18 silica beads (Vydac TP201, N-capped, 5 uM) t which 
had been washed with 20% solvent B (CH 3 CN ♦ 0.1% 
CF 3 CQ0H). Solvent A was 0.1% aqueous CFjCOOH. The 
peptide fragments were eluted at a flow rate of 
1.5 ml/min. with a gradient of 20% to 40% solvent B over 
a period of 30 min. , followed by a gradient of 40% to 80% 
solvent B over an additional 20 min. The column 
fractions were collected and assayed immunologically by 
Western blot. The fractions which eluted between 34 and 
40 minutes after injection contained immunoreactive ENV 
fragments (Figures 16A and 16B). 

EXAMPLE 18 
Western Blot With AIDS Patient Sera 

Western Blot analysis was used to demonstrate the 
usefulness of the ENV-derived CNBr peptides in detecting 
anti-AIDS antibodies present in patient sera. Results 
indicated that, similar to the LE-ENV fusion proteins 
produced from pTrpLE-KAL-ENV-#10, pSB12-854, and 
pSB12-854AHB, CNBr fragments of SB12-854AHB served as 
ideal diagnostic markers for AIDS virus infection. 
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Claims 



1. A bacterial expression vector for the 
expression of fusion proteins, which vector comprises: 

a TrpLE DNA gene fragment; 

an AIDS virus DNA sequence; 

optionally a three-frame linker segment 
between said TrpLE fragment and said AIDS virus DNA 
sequence; and 

optionally a cleavable linker sequence between 

10 

said TrpLE fragment and said AIDS virus DNA sequence. 

2. The vector of Claim 1 wherein said cleavable 
linker encodes Met, Asp-Pro, or Asp-Asp-Asp-Asp-Lys * 

15 

3. The vector of Claim 1 wherein said AIDS virus 
DNA sequence is the ENV gene or a portion thereof, which 
preferably encodes the polypeptide KAL-10. 

4. The vector of Claim 3 which is pTrpLE-KAL, 

20 

pTrpLE-KAL-ENV-*10, pSB12-854, or pSB12-854AHB . 

5. A method for producing a bacterial expression 
vector for the expression of fusion proteins, which 
method comprises: 

selecting a plasmid containing a TrpLE DNA 

gene; 

cleaving the TrpLE DNA gene with a restriction 
endonuclease ; and 

inserting an AIDS virus DNA sequence. 

30 

6. The method of Claim 5 which further comprises 
inserting a cleavable linker sequence before or after 
inserting said AIDS virus sequence. 
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7. The method of Claim 6 wherein said cieavable 
linker sequence encodes Met, Asp-Pro, or 
Asp-Asp-Asp-Asp-Lys. 

8. A method for producing a bacterial expression 
vector for the expression of fusion proteins, which 
method comprises: 

selecting a plasmid containing a TrpLE DNA 

gene; 

cleaving the TrpLE ONA gene with a restriction 

endonuclease ; 

inserting a three frame linker; and 
inserting an AIDS virus DNA sequence within 

the three frame linker. 

9. The method of Claim 8 wherein said AIDS virus 
DNA sequence is KAL-10 or a portion thereof. 



10. The method of Claim 8 wherein said AIDS virus 
2Q DNA sequence comprises KAL-A, KAL-B, analogs or subsets , 
of KAL-A or KAL-B, or mixtures thereof, wherein KAL-A and 
KAL-B have the following sequences: 

Kal-A : 

Y Asp Asn Trp Arg Ser Glu Leu Tyr Lys 

25 Tyr Lys Val Val Lys He Glu Pro Leu Gly Val 

Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 

Arg Glu Lys Arg Ala Val Gly He Gly Ala Leu 

Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X; 

Kal-B: 



30 





Z 


Leu 


Thr 


Val 


Gin 


Ala 


Arg 


Gin 


Leu 


Leu 


Ser 


Gly 


He 


Val 


Gin 


Gin 


Gin 


Asn 


Asn 


Leu 


Leu 


Arg 


Ala 


He 


Glu 


Ala 


Gin 


Gin 


His 


Leu 


Leu 


Gin 


Leu 


Thr 


Val 


Trp 


Gly 


He 


Lys 


Gin 


Leu 


Gin 


Ala 


Arg 


He 


Leu 


Ala 


Val 


Glu 


Arg 


Tyr 


Leu 


Lys 


Asp 


Gin 


Gin 


Leu 


Leu 


Gly 


He 


Trp 


Gly 


Cys 


Ser Gly 


Lys 


Leu 


He 


Cys 


Thr 


Thr 


Ala 


Val 


Pro 


Trp 


Asn 


Ala 


Ser 


Trp 


Ser 


Asn 


Lys 


Ser 


Leu 


Glu 


Gin 


He 


Trp 


Asn 


Asn 


X. 
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wherein X is hamoserine lactone or its 
hydrolysis or amidation products, methionine, or 
another naturally occurring amino acid, Y is 
Met-Arg or Arg and Z is Met-Thr or Thr. 

11. The method of Claim 8 which further comprises 
5 inserting a cleavable linker sequence. 

12. The method of Claim 8 wherein said cleavable 
linker sequence encodes Met, Asp-Pro, or 
Asp-Asp-Asp-Asp-Lys . 

10 

13. A fusion protein, which comprises: 
a TrpLE polypeptide fragment; 

an AIDS virus polypeptide, or a subset or 
analog thereof; 
15 optionally a cleavable linker oligopeptide 

between said TrpLE polypeptide fragment and said 
AIDS virus polypeptide; and 

optionally a three-frame linker oligopeptide 
segment between said TrpLE polypeptide fragment and 
20 said AIDS virus polypeptide. 



14. The fusion protein of Claim 13 wherein said 
cleavable linker oligopeptide is Met, Asp-Pro, 
Asp-Asp-Asp-Asp-Lys. 

15. The fusion protein of Claim 13 wherein said 
AIDS virus polypeptide is an ENV protein or a fragment 
thereof, preferably KAL-10. 
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16. An AIDS polypeptide fragment, which comprises 
the following sequence, its immunoreactive subsets and 
analogs: 



Y Asp Asn. Trp Arg Ser Glu Leu Tyr Lys Tyr 

5 Lys Val Val Lys lie Glu Pro Leu Gly Val Ala 

Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 

Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe 

Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X; 

wherein X represents homoserine lactone or its 
hydrolysis or amidation products, methionine, or another 
10 natural or synthetic amino acyl residue and Y is Met-Arg 
or Arg. 



17. An AIDS polypeptide fragment, which comprises 
the following sequence, its immunoreactive subsets and 
analogs : 



Z Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 
Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg 
20 He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys 
Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
Ser Trp Ser Asn Lys Ser Leu Glu Gin He Trp 
Asn Asn X; 

where X represents homoserine lactone or its 
25 hydrolysis or amidation products, methionine, or another 
natural or synthetic amino acyl residue and Z is Met-Thr 
or Thr. 
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18. An immunoassay reagent for binding antibodies 
from humans exposed to AIDS virus, wherein said reagent 
is a fusion protein which comprises 

a TrpLE polypeptide fragment; 

an AI0S virus polypeptide; and 

optionally a three-frame linker oligopeptide 

segment between said TrpLE oligopeptide fragment and 

said AIDS virus polypeptide. 

19. The immunoassay reagent of Claim 18 wherein 
said AIDS virus polypeptide is an ENV protein or a 
fragment thereof, preferably KAL-10. 

20. An immunoassay reagent for binding antibodies 
from humans exposed to AIDS virus, wherein said reagent 
is an AIDS polypeptide sequence, or an analog or subset 
thereof, which sequence comprises 

Y Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 

Lys Val Val Lys He Glu Pro Leu Gly Val Ala 

Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 

Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe 

Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X; 

where X represents homoserine lactone or its 
hydrolysis or amidation products, methionine, or another 
natural or synthetic amino acyl residue and Y is Met-Arg 
or Arg. 
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21. An immunoassay reagent for binding antibodies 
from humans exposed to AIDS virus, wherein said reagent 
is an AIDS polypeptide sequence, or an analog or subset 
thereof, which sequence comprises 



Z Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 
Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg 
He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys 
Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
Ser Trp Ser Asn Lys Ser Leu Glu Gin lie Trp 
Asn Asn X; 

where X represents homoserine lactone or its 
hydrolysis or amidation products, methionine, or another 
natural or synthetic amino acyl residue and Z is Met-Thr 
or Thr. 
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22. An immunoassay reagent for binding antibodies 
from humans exposed to AIDS virus, wherein said reagent 
is a mixture of two or more AIDS polypeptide sequences, 
or analogs or subsets thereof, which sequences comprise 

5 V Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 

Lys Val Val Lys He Glu Pro Leu Gly Val Ala 
Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 
Glu Lys Arg Ala Val Gly lie Gly Ala Leu Phe 
Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X; 

and 

tO Z Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 

Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg 
He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys 
Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 
Ser Trp Ser Asn Lys Ser Leu Glu Gin He Trp 

15 Asn Asn X; 

where X represents homoserine lactone or its 
hydrolysis or amidation products, methionine, or another 
natural or synthetic amino acyl residue, Y is Met-Arg or 
20 A *g and 1 is M et-Thr or Thr, 

23. The use of a fusion protein comprising: 
a TrpLE polypeptide fragment; 

an AIDS virus polypeptide; 

optionally a cleavable linker oligopeptide 

25 

between said TrpLE polypeptide fragment and said 
AIDS virus polypeptide; and 

optinally a three-frame linker oligopeptide 
segment between said TrpLE polypeptide fragment and 
said AIDS virus polypeptide; 

for the manufacture of a preparation inducing 
antibody formation in mammals. 
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1 24. The use of Claim 23 wherein said AIDS virus 

polypeptide is an ENV polypeptide or a fragment thereof, 
preferably KAL-lCh 

5 25. The use of Claim 23 or 24 wherein said anti- 

bodies are polyclonal or monoclonal and are useful for 
detecting the presence of AIDS virus* 

26. The use of Claim 23 or 24 wherein said anti- 
10 bodies are neutralizing antibodies useful for conferring 

immunity to infection by AIDS virus. 

27. The use of an AIDS polypeptide fragment com- 
prising KAL-A, KAL-B, immunoreac tive subsets or analogs 

15 of KAL-A or KAL-B or mixtures thereof, wherein KAL-A or 
KAL-B have the following sequences: 

Kal-A : 

Y Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 

Lys Val Val Lys He Glu Pro Leu Gly Val Ala 

Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 

20 Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe 

Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X; 

Kal-3 : 

Z Leu Thr Val. Gin Ala Arg Gin Leu Leu Ser 

Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 

Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg 

He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 

Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys 

Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala 

Ser Trp Ser Asn Lys Ser Leu Glu Gin He Trp 
Asn Asn X; 



25 



30 



where X represents homoserine lactone, methionine, 
or another natural or synthetic amino acyl residue, Y 
is Met-Arg or Arg and Z is Met-Thr or Thr, 
for the manufacture of a preparation inducing antibody 
formation in mammals. 
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1 28. The use of an AIDS polypeptide fragment com- 

prising KAL-A, KAL-B, immun ore active subsets or analogs 
of KAL-A or KAL-B, or mixtures thereof, wherein KAL-A and 
KAL-B have the following sequences: 

5 

Kal-A : 

/ Y Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr 
Lys Val Val Lys He Glu Pro Leu Gly Val Ala 
Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg 
Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe 
Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr X; 

10 Kal-B ; 

Z Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 
Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 
Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg 
He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys 
1R Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala 

AO Ser Trp Ser Asn Lys Ser Leu Glu Gin He Trp 

Asn Asn X; 

where X represents homoserine lactone, methionine, or 
another natural or synthetic amino acyl residue, Y is 
Met-Arg or Arg and Z is Met-Thr or Thr 

for the manufacture of a medicament for reducing or preven- 
ting AIDS virus infection in a mammal by competitively 
binding to cellular viral receptors. 
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29. The use of a fusion protein comprising 
a TrpLE polypeptide fragment; and 

an AIDS virus polypeptide recognized by a 
mammalian cell virus receptor 

for the manufacture of a medicament for re- 
ducing or preventing AIDS virus infection in a mammal by 
competitively binding to cellular viral receptors. 

30. A vaccine Which comprises a fusion protein 
according to any one of Claims 13 to 15 or a portion there 
of and a pharmaceutical^ acceptable excipient, preferably 
an adjuvant. 
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31. A medicament for reducing or preventing AIDS 
virus infection in a mammal by competitively binding to 
cellular viral receptors, which comprises a fusion protein 
comprising 

a TrpLE polypeptide fragment; and 
an AIDS virus polypeptide recognized by a 
mammalian cell virus receptor. 
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mRNA from Plasmid pTrp-Sst II-HuGRF 

50 

AAGTTCACGT AAAAAGGGTA TCGACA ATG AAA GCA ATT TTC GTA CTG AAA GGT TCA CTG GAC 

Met Lys Ala lie Phe Val Leu Lys Gly Ser Leu Asp 

100 

AGA GAT CTC GAC AGC CGT ATT GAA CTG GAA ATG CGT ACC GAT CAT AAA GAG CTG TCT GAA 
Arc Asp Leu Asp Ser Arg He Glu Leu Glu Met Arg Thr Asp His Lys Glu Leu Ser Glu 

150 

CAT CTG ATG CTG GTT GAT CTC GCC CGT AAT GAT CTG GCA CSC ATT TGC ACC CCC GGC AGC 
His Leu Met Leu Val Asp Leu Ala Arg Asn Asp Leu Ala Arg He Cys Thr Pro Gly Ser 

200 

CGC TAC GTC GCC GAT CTC ACC AAA GTT GAC CGT TAT TCC TAT GTG ATG CAC CTC GTC TCT 
Arg Tyr Val Ala Asp Leu Thr Lys Val Asp Arg Tyr Ser Tyr Val Met His Leu Val Ser 

250 300 
CGC GTA GTC GGC GAA CTG CGT CAC GAT CTT GAC GCC CTG CAC GCT TAT CGC GCC TGT ATG 
Arg Val Val Gly Glu Leu Arg His Asp Leu Asp Ala Leu His Ala Tyr Arg Ala Cys Met 

350 

AAT ATG GCG ACG TTA AGC GGT GCG CCG AAA GTA CGC GCT ATG CAG T*A ATT GCC GAG GCG 
Asn Met Gly Thr Leu Ser Gly Ala Pro Lys Val Arg Ala Met Gin Leu He Ala Glu Ala 

Sst II 400 



GAA GGT CGT CGC CGC GGG ATG GTC CAT ATG TAC GCT GAT GCT ATA TTC ACT AAC TCT TAC 
Glu Gly Arg Arg Arg Gly Met Val His Met TYR ALA ASP ALA ILE PHE THR ASN SER TYR 

450 

CGC AAG GTT CTG GGC CAA TTA TCA GCA CGC AAG CTT CTG CAG GAC ATT CTT AGT CGA CAA 
ARG LYS VAL LEU GLY GLN LEU SER ALA ARG LYS LEU LEU GLN ASP ILE LEU SER ARG GLN 

500 

CAG GGC GAA TCT AAC CAG GAG CGT GGC GCC AGA GCC AGA CTG TGA TCC GTCGGATGTC 
GLN GLY GLU SER ASN GLN GLU ARG GLY ALA ARG ALA ARG LEU 

550 600 
CAAAAGGTCG CCACTTTTCC ACTTCCTGTT TCATTCTCGT TCCTTAAGAA AGTT AC AC AG GAATAGTATC 

650 

GAATCCTTGC CGAGTTTGAC TCAATTAGTT CAGTCAGTTT CAGGATATTA GTCATCTCTA CATTGATTAT 

700 750 
GA GTA TTC AG AAATTCCTTA AATATTCTGA CAAATGCTCT TTCCCTAAAC TCCCCCCATA AAAAAACCCG 

"760 

CCGAAGCGGG TTTTT 
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mRNA from Plasmid pTrp-BssH II-HuGRF 

50 

AAGTTCACGT AAAAAGGGTA TCGACA ATG AAA GCA ATT TTC GTA CTG AAA GGT TCA CTG GAC 

Met Lys Ala He Phe Val Leu Lys Gly Ser Leu Asp 

100 

AGA GAT CTC GAC AGC CGT ATT GAA CTG GAA ATG CGT ACC GAT CAT AAA GAG CTG TCT GAA 
Arg Asp Leu Asp Ser Arg He Glu Leu Glu Met Arg Thr Asp His Lys Glu Leu Ser Glu 

150 

CAT CTG ATG CTG GTT GAT CTC GCC CGT AAT GAT CTG GCA CGC ATT TGC ACC CCC GGC AGC 
His Leu Met Leu Val Asp 'Leu Ala Arg Asn Aso Leu Ala Arg He Cys Thr Pro Gly Ser 

200 

CGC TAC GTC GCC GAT CTC ACC AAA GTT GAC CGT TAT TCC TAT GTG ATG CAC CTC GTC TCT 
Arg Tyr Val Ala Asp Leu Thr Lys Val Asp Arg Tyr Ser Tyr Val Met His Leu Val Ser 

250 300 
CGC GTA GTC GGC GAA CTG CGT CAC GAT CTT GAC GCC CTG CAC GCT TAT CGC GCC TGT ATG 
Arg Val Val Gly Glu Leu Arg His Asp Leu Asp Ala Leu His Ala Tyr Arg Ala Cys Met 

350 

AAT ATG GGG ACG TTA AGC GGT GCG CCG AAA GTA CGC GCT ATG CAG TTA ATT GCC GAG GCG 
Asn Met Gly Thr Leu Ser Gly Ala Pro Lys Val Arg Ala Met Gin Leu He Ala Glu Ala 

400 

GAA GGT CGT CGC CGC GGC AGC TAC GGC GGC GCG GTA GGT TAT TTC ACC GCG CAT GGC GAT 
Glu Gly Arg Arg Arg Gly Ser Tyr Gly Gly Ala Val Gly Tyr Phe Thr Ala His Gly Asp 

450 

CTC GAC ACC TGC ATT GTG ATC CGC TCG GCG CTG GTG GAA AAC GGT ATC GCC ACC GTG CAA 
Leu Asp Thr Cys He Val He Arg Ser Ala Leu Val Glu Asn Gly He Ala Thr Val Gin 

500 

GCG GGT GCT GGT GTA GTC CTT GAT TCT GTT CCG CAG TCG GAA GCC GAC GAA ACC CGT AAC 
Ala Gly Ala Gly Val Val Leu Asp Ser Val Pro Gin Ser Glu Ala Asp Glu Thr Arg Asn 

5 ?° .BssH II m m 600 

AAA GCC CGC GCT GTA CTG CGC GCT ATG TAC GCT GAT GCT ATA TTC ACT AAC TCT TAC CGC 
Lys Ala Arg Ala Val Leu Arg Ala Met TYR ALA ASP ALA ILE PHE THR ASN SER TYR ARG 

650 

AAG GTT CTG GGC CAA TTA TCA GCA CGC AAG CTT CTG CAG GAC ATT CTT AGT CGA CAA CAG 
LYS VAL LEU GLY GLN LEU SER ALA ARG LYS LEU LEU GLN ASP ILE LEU SER ARG GLN GLN 

700 

GGC GAA TCT AAC CAG GAG CGT GGC GCC AGA GCC AGA CTG TGA TCCGTC GGATGTCCAA 
GLY GLU SER ASN GLN GLU ARG GLY ALA ARG ALA ARG LEU 

750 - 

AAG GTC GCC A CTTTTCCACT TCCTGTTTCA TTCTCGTTCC TTAAGAAAGT TACACAGGAA TAGTATCGAA 

800 850 
TCCTTGCCGA GTTTGACTCA ATTAGTTCAG TCAGTTTCAG GATATTAGTC ATCTCTACAT TGATTATGAG 

900 

TATTCAGAAA TTCCTTAAAT ATTCTGACAA ATGCTCTTTC CCTAAACTCC CCCCATAAAA AAACCCGCCG 
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